снаттер 38

TESTING THE FOUNDATIONS OF RELATIVITY

Provando e riprovando
(Verify the one and disprove the other)

§38.1. TESTING IS EASIER IN THE SOLAR SYSTEM THAN IN REMOTE SPACE

For the first half-century of its life, general relativity was a theorist's paradise, but an experimentalist's hell. No theory was thought more beautiful, and none was more difficult to test.
The situation has changed. In the last few years general relativity has become one of the most exciting and fruitful branches of experimental physics. A half-century late, the march of technology has finally caught up with Einstein's genius-not only on the astronomical front, but also in laboratory experiments.
On the astronomical front, observers search for phenomena in which relativity is important, and study them: cosmology, pulsars, quasars, gravitational waves, black holes. Unfortunately, in pulsars and quasars, and in the sources of cosmological radiation and gravity waves, gravitational effects are tightly interwoven with the local hydrodynamics and local plasma physics. There is little hope of separating the several effects sufficiently sharply to get clean tests of the nature of gravity. Instead, astrophysicists must put the laws of gravity into their calculations along with all the other laws of physics and the observational data; and they must then seek, as output, information about the doings of matter and fields "way out there."
Thus, for clean tests of general relativity one turns to the laboratory-but to a laboratory that is much larger today than formerly: a laboratory that includes the entire solar system.
Clean tests of general relativity are currently confined to solar system
Capabilities of technology in 1970's
In the solar system all relativistic effects are tiny. Nonetheless, some of them are measurable with a precision, in the 1970 's, of one part in 1,000 of their whole magnitude or better (see Box 38.1).

§38.2. THEORETICAL FRAMEWORKS FOR ANALYZING TESTS OF GENERAL RELATIVITY

There are now possible many experiments for testing general relativity. But most of them are expensive; very expensive. They involve atomic clocks flown on space-
Box 38.1 TECHNOLOGY OF THE 1970's CONFRONTED WITH RELATIVISTIC PHENOMENA
Quantity to
Magnitude of
be measured
Magnitude of be measured| Magnitude of | | :--- | | be measured |
Precision of a one-day
relatistic effects measurement in the early 1970's
Quantity to "Magnitude of be measured" Precision of a one-day relatistic effects measurement in the early 1970's | Quantity to | Magnitude of <br> be measured | Precision of a one-day | | :--- | :--- | :--- | | relatistic effects | measurement in the early 1970's | |
Angular separation of two sources on the sky
Solar deflection of starlight
(1) if light ray grazes edge of Sun, 1 .75 1 .75 1^('').751^{\prime \prime} .751.75
(2) if light ray comes in perpendicular to Earth-sun line,
0 .004 0 .004 0^('').0040^{\prime \prime} .0040.004 measurement in the early 1970's
(a) With optical telescope, 1 1 ∼1^('')\sim 1^{\prime \prime}1
(b) Angular separation of two quasars with radio telescope (differential measurement from day to day, not absolute measurement)
in 1970 , 0 .1 in mid 1970 s , 0 .001  in  1970 , 0 .1  in mid  1970 s , 0 .001 {:[" in "1970","∼0^('').1],[" in mid "1970^(')s","∼0^('').001]:}\begin{aligned} & \text { in } 1970, \sim 0^{\prime \prime} .1 \\ & \text { in mid } 1970^{\prime} \mathrm{s}, \sim 0^{\prime \prime} .001 \end{aligned} in 1970,0.1 in mid 1970s,0.001
Distance between two bodies in solar system
(a) Perihelion shift per Earth year
(1) for Mercury, 120 km
(2) for Mars, 15 km
(b) Relativistic time delay for radio waves from Earth, past limb of sun, to Venus (one way),
1 × 10 4 sec = 30 km 1 × 10 4 sec = 30 km 1xx10^(-4)sec=30km1 \times 10^{-4} \mathrm{sec}=30 \mathrm{~km}1×104sec=30 km
(c) Periodic relativistic effects in Earth-moon separation
(1) in general relativity,
100 cm 100 cm 100cm100 \mathrm{~cm}100 cm
(2) in Jordan-Brans-Dicke theory, 100 cm ; ( 840 cm ) / ( 2 + ω ) 100 cm ; ( 840 cm ) / ( 2 + ω ) 100cm;quad(840cm)//(2+omega)100 \mathrm{~cm} ; \quad(840 \mathrm{~cm}) /(2+\omega)100 cm;(840 cm)/(2+ω)
(a) Separation of another planet (Mercury, Venus, Mars) from Earth, by bouncing radar signals off it,
0.3 km 0.3 km ∼0.3km\sim 0.3 \mathrm{~km}0.3 km
(b) Separation of a radio transponder (on another planet or in a space craft) from Earth, by measuring round-trip radio travel time,
3 × 10 8 sec = 10 m = 0.01 km 3 × 10 8 sec = 10 m = 0.01 km ∼3xx10^(-8)sec=10m=0.01km\sim 3 \times 10^{-8} \mathrm{sec}=10 \mathrm{~m}=0.01 \mathrm{~km}3×108sec=10 m=0.01 km
(c) Earth-moon separation by laser ranging, 10 cm 10 cm ∼10cm\sim 10 \mathrm{~cm}10 cm
Difference in lapse of proper time between two world lines in solar system
(a) Clock on Earth vs. clock in synchronous Earth orbit,
Δ t / t 6 × 10 10 Δ t / t 6 × 10 10 Delta t//t∼6xx10^(-10)\Delta t / t \sim 6 \times 10^{-10}Δt/t6×1010
(b) Clock on Earth vs. clock in orbit about sun,
Δ t / t 10 8 Δ t / t 10 8 Delta t//t∼10^(-8)\Delta t / t \sim 10^{-8}Δt/t108
Stability of a hydrogen maser clock,
Δ t / t 10 13 for t up to one year Δ t / t 10 13  for  t  up to   one year  {:[Delta t//t∼10^(-13)" for "t" up to "],[" one year "]:}\begin{gathered} \Delta t / t \sim 10^{-13} \text { for } t \text { up to } \\ \text { one year } \end{gathered}Δt/t1013 for t up to  one year 
craft; radar signals bounced off planets; radio beacons and transponders landed on planets or orbited about them; etc. Because of the expense, it is crucial to have as good a theoretical framework as possible for comparing the relative values of various experiments-and for proposing new ones, which might have been overlooked.
Such a framework must lie outside general relativity. It must scrutinize the foundations of Einstein's theory. It must compare Einstein's theory with other viable theories of gravity to see which experiments can distinguish between them. It must be a "theory of theories."
At present, in 1973, there are two different frameworks in broad use. One, devised largely by Dicke (1964b), ^(**){ }^{*} assumes almost nothing about the nature of gravity. It is used to design and discuss experiments for testing, at a very fundamental level, the nature of spacetime and gravity. Within it, one asks such questions as: Do all bodies respond to gravity with the same acceleration? Is space locally isotropic in its intrinsic properties? What are the theoretical implications of local isotropy? What types of fields, if any, are associated with gravity: scalar fields, vector fields, tensor fields, affine fields? Although some of the experiments that tackle these questions will be discussed below, this book will not attempt a detailed exposition of the Dicke framework.
The second framework in broad use is the "parametrized post-Newtonian (PPN) formalism." It has been developed to higher and higher levels of sophistication by Eddington (1922), Robertson (1962), Schiff (1962, 1967), Nordtvedt (1968b, 1969), Will (1971c), and Will and Nordtvedt (1972).
The PPN formalism is an approximation to general relativity, and also to a variety of other contemporary theories of gravity, called "metric theories." It is a good approximation whenever, as in the solar system, the sources of the field gravitate weakly ( | Φ | / c 2 1 ) | Φ | / c 2 1 (|Phi|//c^(2)≪1)\left(|\Phi| / c^{2} \ll 1\right)(|Φ|/c21) and move slowly ( v 2 / c 2 1 ) v 2 / c 2 1 (v^(2)//c^(2)≪1)\left(v^{2} / c^{2} \ll 1\right)(v2/c21). The PPN formalism contains a set of ten parameters whose values differ from one theory to another. Solar-system experiments (measurements of perihelion shift, light deflection, etc.) can be regarded as attempts to measure some of these PPN parameters, and thereby to determine which metric theory of gravity is correct-general relativity, Brans-Dicke (1961)Jordan (1959) theory, one of Bergmann's (1968) scalar-tensor theories, one of Nordstrøm's theories, Whitehead's (1922) theory, or something else. [For reviews of Nordstrøm and Whitehead, see Whitrow and Morduch (1965), Will (1971b), and Ni (1972). For a significant nonmetric theory, see Cartan (1920) and Trautman (1972).]
Chapter 39 will discuss the concept of a metric theory of gravity and will construct the PPN formalism; and then Chapter 40 will use the PPN formalism to analyze the systematics of the solar system, and to discuss a variety of past and future experiments that distinguish between various metric theories of gravity. But first, as a prelude to those topics, this chapter will examine experiments that test the foundations of general relativity-foundations on which most other metric theories also rest. For a more detailed discussion of most of these experiments, see Dicke (1964b).
Theoretical frameworks for analyzing gravitation experiments:
(1) Dicke framework
(2) PPN framework

The rest of this chapter is

Track 2.
No earlier Track-2 material is needed as preparation for it, but Chapter 7 (incompatibility of gravity and special relativity) will be helpful.
This chapter is not needed as preparation for any later chapter, but it will be helpful in Chapters 39 and 40 (other theories; PPN formalism; experimental tests).
Eötvös-Dicke experiment to test uniqueness of free fall

§38.3. TESTS OF THE PRINCIPLE OF THE UNIQUENESS OF FREE FALL: EÖTVÖS-DICKE EXPERIMENT

One fundamental building block common to Einstein's theory of gravity and to almost all other modern theories is the principle of "uniqueness of free fall":* "The world line of a freely falling test body is independent of its composition or structure." By "test body" is meant an electrically neutral body, small enough that (1) its self-gravitational energy, as calculated using standard Newtonian theory, can be neglected compared to its rest mass ( M / R 1 ) ( M / R 1 ) (M//R≪1)(M / R \ll 1)(M/R1), and (2) the coupling of its multipole moments to inhomogeneities of the gravitational field can be neglected. \dagger
The uniqueness of free fall permits one to regard spacetime as filled with a set of curves, the test-body trajectories, which are unique aside from parametrization. Through each event, along each timelike or null direction in spacetime, there passes one and only one test-body trajectory. Describe these trajectories mathematically: that is a central imperative of any theory of gravity.
When translated into Newtonian language, the uniqueness of free fall states that any two test bodies must fall with the same acceleration in a given external gravitational field. Experimental tests of this principle search for differences in acceleration from one body to another. The most precise experiments to date are of a type devised by Baron Lorand von Eötvös (Box 38.2), redesigned and pushed to much higher precision by the Princeton group of Robert H. Dicke (Box 38.3), and extended with modifications by the Moscow group of Vladimir B. Braginsky. (See Figure 1.6 and Box 1.2 for experimental details.)
These Eötvös-Dicke experiments are "null experiments." They balance the acceleration of one body against the acceleration of another, and look for tiny departures from equilibrium. The reason is simple. Null experiments typically have much higher precision than experiments measuring the value of a nonzero quantity.
Eötvös, Pekar, and Fekete (1922) checked to an accuracy of 5 parts in 10 9 10 9 10^(9)10^{9}109 that the Earth imparts the same acceleration to wood, platinum, copper, asbestos, water, magnalium ( 90 % Al , 10 % Mg 90 % Al , 10 % Mg 90%Al,10%Mg90 \% \mathrm{Al}, 10 \% \mathrm{Mg}90%Al,10%Mg ), copper sulphate, and tallow. Renner (1935) checked, to 7 parts in 10 10 10 10 10^(10)10^{10}1010, the Earth's acceleration of platinum, copper, bizmuth, brass, glass, ammonium fluoride, and an alloy of 30 % Mg , 70 % Cu 30 % Mg , 70 % Cu 30%Mg,70%Cu30 \% \mathrm{Mg}, 70 \% \mathrm{Cu}30%Mg,70%Cu. Dicke, and later Braginsky, chose to use the sun's gravitational acceleration rather than the Earth's, since the alternation in the direction of the sun's pull every 12 hours lends itself to amplification by resonance. (See Figure 1.6.) Roll, Krotkov, and Dicke (1964) reported an
agreement of 1 part in 10 11 10 11 10^(11)10^{11}1011 between the sun's acceleration of aluminum and gold, while Braginsky and Panov (1971) reported agreement to 1 part in 10 12 10 12 10^(12)10^{12}1012 for aluminum and platinum.
From this agreement, one can infer the response of neutrons, protons, electrons, virtual electron-positron pairs, nuclear binding energy, and electrostatic energy to the sun's gravity. Gold is 60 % 60 % 60%60 \%60% neutrons, while aluminum is only 50 % 50 % 50%50 \%50% neutrons. Therefore even from the 1964 results one could conclude that neutrons and protons must have the same acceleration to within [ 0.6 0.5 = 0.1 ] 1 [ 0.6 0.5 = 0.1 ] 1 [0.6-0.5=0.1]^(-1)[0.6-0.5=0.1]^{-1}[0.60.5=0.1]1 parts in 10 11 = 1 10 11 = 1 10^(11)=110^{11}=11011=1 part in 10 10 10 10 10^(10)10^{10}1010. Similarly, electrons must accelerate the same as nucleons to 2 parts in 10 7 10 7 10^(7)10^{7}107; virtual pairs (being more abundant in gold than in aluminum) must accelerate the same to 1 part in 10 4 10 4 10^(4)10^{4}104; nuclear binding energy, to 1 part in 10 7 10 7 10^(7)10^{7}107; and electrostatic energy to 3 parts in 10 9 10 9 10^(9)10^{9}109.
This accuracy of testing gives one confidence in the principle of the uniqueness of free fall.
(continued on page 1054)
Theoretical implications of Eötvös-Dicke experiment

Box 38.2 BARON LORAND VON EÖTVÖS

Budapest, July 27, 1848-Budapest, April 8, 1919
Eötvös (pronounced ut'vûsh) studied at Heide'herg with Kirchhoff, Helmholtz, and Bunsen and at Königsberg with Neumann and Richelot. His 1870 Heidelberg Ph.D. thesis dealt with an issue of relativity: can the motion of a light source relative to an "ether" be detected by comparing the light intensities in the direction of the motion and in the opposite direction?
Studies of his at the same time resulted in the Eötvös law of capillarity, (surface tension ) 2.12 ( T crit T ) / ( specific volume ) 2 / 3 ) 2.12 T crit  T / (  specific volume  ) 2 / 3 )~~2.12(T_("crit ")-T)//(" specific volume ")^(2//3)) \approx 2.12\left(T_{\text {crit }}-T\right) /(\text { specific volume })^{2 / 3})2.12(Tcrit T)/( specific volume )2/3. Eötvös, made professor of physics at Budapest in 1872, concentrated on gravity from 1886 onward. He developed and extended the original Michell-Cavendish torsion balance, which measured not only Φ , x x Φ , x x Phi_(,xx)\Phi_{, x x}Φ,xx and Φ , x y Φ , x y Phi_(,xy)\Phi_{, x y}Φ,xy (where Φ Φ Phi\PhiΦ is the gravitational potential) but also Φ , x z Φ , x z Phi_(,xz)\Phi_{, x z}Φ,xz and Φ , y z Φ , y z Phi_(,yz)\Phi_{, y z}Φ,yz, all to a precision destined to be unexcelled for decades. He showed that the so-called "ratio between gravitational mass and inertial mass" cannot vary from material to material by more than 5 parts in 10 9 10 9 10^(9)10^{9}109. He investigated the paleomagnetism of bricks and other ceramic objects, and studied the shape of the earth. He served (June 1894-January 1895 ) as minister of public instruction and religious affairs (a cabinet position held in earlier years by his father). He founded a school which trained high-school teachers, to whose leavening influence one can give some of the credit for such outstanding scientists as von Karman, von Neuman, Teller, and Wigner. He served one year as rector of the University of Budapest.
"I can never forget the moment when my train rushed into the railroad station of Heidelberg along the banks of the Neckar. . I cannot forget my happiness that now I could breathe the same air as those men of science whose fame attracted me there."
[EÖTVÖS IN 1887, AS QUOTED IN FEJÉR AND MIKOLA (1918), P. 259.]
Box 38.2 (continued)
Photograph by A. Szekely 1913
"Insofar as it is permitted on the basis of a few experiments, we can therefore declare that μ μ mu\muμ, that is, the weakening of the Earth's attraction through the intervening compensator quadrants, is less than one part in 5 × 10 10 5 × 10 10 5xx10^(10)dots5 \times 10^{10} \ldots5×1010 the absorption (of gravity) by the entire earth along a diameter is less than about one part in 800.
"We have carried out a series of observations which surpassed all previous ones in precision, but in no case could we discover any detectable deviation from the law of proportionality of gravitation and inertia."
[EÖTVÖS, PEKÁR, AND FEKETE (1922).]
"Science shall never find that formula by which its necessary character could be proved. Actually science itself might cease if we were to find the clue to the
secret."
[EÖTVÖS, PRESIDENTIAL ADDRESS TO THE HUNGARIAN ACADEMY OF SCIENCES, 1890,
AS QUOTED IN FEJÉR AND MIKOLA (1918), P. 280.]
"We should consider it as one of the most astonishing errors of the present age that so many people listen to the words of pseudoprophets who, in place of the dogmas of religion, offer scientific dogmas with medieval impatience but without historical justification."
[EÖTVÖS, 1877, AS QUOTED IN FEJÉR AND MIKOLA (1918), P. 280.]

Box 38.3 ROBERT HENRY DICKE May 6, 1916, St. Louis, Missouri Cyrus Fogg Brackett Professor of Physics at Princeton University

During 1941-1946, Dicke was a leader in replacing the outmoded concept of lumped circuit elements by a new microwave analysis based on symmetry considerations, conservation laws, reciprocity relations, and the scattering matrix-concepts that led, among others, to the lock-in amplifier and the microwave radiometer. Searching for a means to reduce the Doppler width of spectral lines for precision measurements, Dicke discovered recoilless radiation in atomic systems held in a box or in a buffer gas. This development led to (1) the discovery of the basic idea of the gas-cell atomic clock and (2) a much more precise measurement of the gyromagnetic ratio of electrons in the 1 s and 2 s levels of hydrogen and of the hyperfine structure of atomic hydrogen.
A fundamental paper by Dicke in 1954 set forth the theory of coherent radiation processes and of the superradiant state, and laid the foundation for the future development of the laser and the maser, to which he also contributed. His patent no. 2,851,652 (filed May 21, 1956) was the first disclosure of a device for the generation of infrared radiation by a coherent process, and supplied the first suggestion for combining the use of an etalon resonator with an amplifying gas.
Beginning in the 1960's, Dicke brought his talent for precision measurement to the service of experimental cosmology, and with his collaborators: (1)

checked the equivalence principle with the up-to-then unprecedented accuracy of 1 part in 10 11 10 11 10^(11)10^{11}1011; (2) determined the solar oblateness; and (3) suggested that the primordial cosmic-fireball radiation, a tool for seeing deeper into the past history of the universe than has ever before been possible, should be observable, and therefore should be hunted down and found.
"For want of a better term, a gas which is radiating strongly because of coherence will be called 'superradiant.' . . . As the system radiates it passes to states of lower m with r unchanged-to the 'superradiant' region m 0 m 0 m∼0\mathrm{m} \sim 0m0 "
(1954)
"Possibilities are examined for the excitation of optical 'superradiant' states of gas"
Box 38.3 (continued)
"A 'gravitational oblateness' of [the sun of] 5 × 10 5 5 × 10 5 5xx10^(-5)5 \times 10^{-5}5×105 would require the abandonment of Einstein's purely geometrical theory of gravitation. . Such a flattening [of the sun] could be understood as the effect of a rather rapidly rotating interior. . . The answer appears to be that in the past, and to this day, the solar corona with its magnetic field has acted as a brake on the surface of the sun"
(1964a)
"New measurements of the solar oblateness have given a value for the fractional difference of equatorial and polar radii of ( 5.0 ± 0.7 ) × 10 5 ( 5.0 ± 0.7 ) × 10 5 (5.0+-0.7)xx10^(-5)(5.0 \pm 0.7) \times 10^{-5}(5.0±0.7)×105 "
[DICKE AND GOLDENBERG (1967)]
'[The universe must] have aged sufficiently for there to exist elements other than hydrogen. It is well-known that carbon is required to make physicists"
(1961)
"The question of the constancy of such dimensionless numbers is to be settled not by definition but by measurements"
[BRANS AND DICKE (1961)]
"The geophysical data lead to an upper limit of 3 parts in 10 13 10 13 10^(13)10{ }^{13}1013 per year on the rate of change of the fine-structure constant"
[DICKE AND PEEBLES (1962)]
Experimental evidence for existence of a metric

§38.4. TESTS FOR THE EXISTENCE OF A METRIC GOVERNING LENGTH AND TIME MEASUREMENTS, AND PARTICLE KINEMATICS

Special relativity, general relativity, and all other metric theories of gravity assume the existence of a metric field and predict that this field determines the rates of ticking of atomic clocks and the lengths of laboratory rods by the familiar relation d τ 2 = d τ 2 = -dtau^(2)=-d \tau^{2}=dτ2= d s 2 = g α β d x α d x β d s 2 = g α β d x α d x β ds^(2)=g_(alpha beta)dx^(alpha)dx^(beta)d s^{2}=g_{\alpha \beta} d x^{\alpha} d x^{\beta}ds2=gαβdxαdxβ.
The experimental evidence for a metric comes largely from elementary particle physics. It is of two types: first, experiments that measure time intervals directly, e.g., measurements of the time dilation of the decay times of unstable particles;* second, experiments that reveal the fundamental role played by the Lorentz group in particle kinematics and elsewhere in particle physics. \dagger To cast out the metric tensor entirely would leave one with no theoretical framework adequate for interpreting such experiments.
Notice what particle-physics experiments do and do not tell one about the metric tensor, g g g\boldsymbol{g}g. First, they do not guarantee that there exist global Lorentz frames, i.e., coordinate systems extending throughout all of spacetime, in which g α β = η α β g α β = η α β g_(alpha beta)=eta_(alpha beta)g_{\alpha \beta}=\eta_{\alpha \beta}gαβ=ηαβ. However, they do suggest that at each event P P P\mathscr{P}P there exist orthonormal frames with e α ^ ( P ) e β ( P ) = η α β e α ^ ( P ) e β ( P ) = η α β e_( hat(alpha))(P)*e_(beta)(P)=eta_(alpha beta)\boldsymbol{e}_{\hat{\alpha}}(\mathscr{P}) \cdot \boldsymbol{e}_{\beta}(\mathscr{P})=\eta_{\alpha \beta}eα^(P)eβ(P)=ηαβ, which are related to each other by Lorentz transformations. These orthonormal frames provide one with a definition of the inner product between any two vectors at a given event-and, thereby, they define the metric field.
Second, particle experiments do not guarantee that freely falling particles move along geodesics of the metric field, i.e., along straight lines in local Lorentz frames. (Here, in $ $ 38.4 $ $ 38.4 $$38.4\$ \$ 38.4$$38.4 and 38.5 , the phrase "local Lorentz frame" means a "normal" coordinate system at an event P P P\mathscr{P}P, in which g α β ( P ) = η α β g α β ( P ) = η α β g_(alpha beta)(P)=eta_(alpha beta)g_{\alpha \beta}(\mathscr{P})=\eta_{\alpha \beta}gαβ(P)=ηαβ and g α β , γ ( P ) = 0 g α β , γ ( P ) = 0 g_(alpha beta,gamma)(P)=0g_{\alpha \beta, \gamma}(\mathscr{P})=0gαβ,γ(P)=0. The term "inertial frame" is avoided because no assertions are made, yet, about test-body motion.) In particular, one does not know from elementary-particle experiments whether the local Lorentz frames in the laboratory are freely falling (so they fly up from the center of the earth and then fall back with Newtonian acceleration g = 980 cm / sec 2 g = 980 cm / sec 2 g=980cm//sec^(2)g=980 \mathrm{~cm} / \mathrm{sec}^{2}g=980 cm/sec2 ), whether they are forever at rest relative to the laboratory walls, or whether they undergo some other type of motion. All one is led to believe is that a metric determines the nature of the spacetime intervals ( d τ 2 = g μ ν d x μ d x ν ) d τ 2 = g μ ν d x μ d x ν (dtau^(2)=-g_(mu nu)dx^(mu)dx^(nu))\left(d \tau^{2}=-g_{\mu \nu} d x^{\mu} d x^{\nu}\right)(dτ2=gμνdxμdxν) measured by atomic clocks, that the various local Lorentz frames in the laboratory therefore move with uniform velocity relative to each other (they are connected by Lorentz transformations), and that electric and magnetic fields and the energies and momenta of particles undergo Lorentz transformations in the passage from one local Lorentz frame to another.
Third, elementary particle experiments do suggest that the times measured by atomic clocks depend only on velocity, not on acceleration. The measured squared interval is d s 2 = g α β d x α d x β d s 2 = g α β d x α d x β ds^(2)=g_(alpha beta)dx^(alpha)dx^(beta)d s^{2}=g_{\alpha \beta} d x^{\alpha} d x^{\beta}ds2=gαβdxαdxβ, independently of acceleration (until the acceleration becomes so great it disturbs the structure of the clock; see § 16.4 § 16.4 §16.4\S 16.4§16.4 and Box 16.3). Equivalently, but more physically, the time interval measured by a clock moving with velocity v j v j v^(j)v^{j}vj relative to a local Lorentz frame is
(38.1) d τ = ( η α β d x α d x β ) 1 / 2 = [ 1 ( v x ) 2 ( v y ) 2 ( v z ) 2 ] 1 / 2 d t (38.1) d τ = η α β d x α d x β 1 / 2 = 1 v x 2 v y 2 v z 2 1 / 2 d t {:(38.1)d tau=(-eta_(alpha beta)dx^(alpha)dx^(beta))^(1//2)=[1-(v^(x))^(2)-(v^(y))^(2)-(v^(z))^(2)]^(1//2)dt:}\begin{equation*} d \tau=\left(-\eta_{\alpha \beta} d x^{\alpha} d x^{\beta}\right)^{1 / 2}=\left[1-\left(v^{x}\right)^{2}-\left(v^{y}\right)^{2}-\left(v^{z}\right)^{2}\right]^{1 / 2} d t \tag{38.1} \end{equation*}(38.1)dτ=(ηαβdxαdxβ)1/2=[1(vx)2(vy)2(vz)2]1/2dt
independently of the clock's acceleration d 2 x j / d t 2 d 2 x j / d t 2 d^(2)x^(j)//dt^(2)d^{2} x^{j} / d t^{2}d2xj/dt2. If this were not so, then particles moving in circular orbits in strong magnetic fields would exhibit different decay rates than freely moving particles-which they do not [Farley et al. (1966)].*

§38.5. TESTS OF GEODESIC MOTION: GRAVITATIONAL REDSHIFT EXPERIMENTS

The uniqueness of free fall, as tested by the Dicke-Eötvös experiments, implies that spacetime is filled with a family of preferred curves, the test-body trajectories. There
Particle experiments do not guarantee existence of global Lorentz frames, or geodesic motion for test particles
Particle experiments do suggest proper time is independent of acceleration
Physical meaning of a comparison between test-body trajectories and geodesics of metric
Pound-Rebka-Snider redshift experiment as a test of geodesic motion
is also another family of preferred curves, the geodesics of the metric g g g\boldsymbol{g}g. It is tempting to identify these geodesics with the test-body trajectories. Einstein's geometric theory of gravity makes this identification ("equivalence principle"). One might conceive of theories that reject this identification. What is the experimental evidence on this point?
In order to see what kinds of experiments are relevant, it is helpful to elucidate the physical significance of the geodesics.
A geodesic of g g g\boldsymbol{g}g is most readily identified locally by the fact that it is a straight line in the local Lorentz frames. Put differently, a body's motion is unaccelerated as measured in a local Lorentz frame if and only if the body moves along a geodesic of g g g\boldsymbol{g}g. Hence, to determine whether test-body trajectories are geodesics, one must compare experimentally the motion of the spatial origin of a local Lorentz frame (as defined by atomic-clock readings) with the motion of a test body (material particle).
It is easy to study experimentally the motions of test bodies; relative to an earthbound laboratory, they accelerate downward with g = 980 cm / sec 2 g = 980 cm / sec 2 g=980cm//sec^(2)g=980 \mathrm{~cm} / \mathrm{sec}^{2}g=980 cm/sec2; and this acceleration can be measured at a given location on the Earth to a precision of 1 part in 10 6 10 6 10^(6)10^{6}106.
Unfortunately, it is much more difficult to measure the motion of a local Lorentz frame, once again as defined by atomic-clock readings. The only direct experimental handle one has on this today, with sufficient precision to be interesting, is gravitational redshift experiments. (See §§7.2-7.5 and § 25.4 § 25.4 §25.4\S 25.4§25.4 for theoretical discussions of the gravitational redshift in the framework of general relativity.)
The redshift experiment of highest precision is that of Pound and Rebka (1960), as improved by Pound and Snider (1965); see Figure 38.1. It used the Mossbauer effect to measure the redshift of 14.4 keV gamma rays from Fe 57 Fe 57 Fe^(57)\mathrm{Fe}^{57}Fe57. The emitter and absorber of the gamma rays were placed at rest at the bottom and top of a tower at Harvard University, separated by a height h = 74 h = 74 h=74h=74h=74 feet = 22.5 = 22.5 =22.5=22.5=22.5 meters. The measured redshift agreed, to 1 percent precision, with the general relativistic prediction of
(38.2) Δ λ / λ = g h = 2.5 × 10 15 . (38.2) Δ λ / λ = g h = 2.5 × 10 15 . {:(38.2)Delta lambda//lambda=gh=2.5 xx10^(-15).:}\begin{equation*} \Delta \lambda / \lambda=g h=2.5 \times 10^{-15} . \tag{38.2} \end{equation*}(38.2)Δλ/λ=gh=2.5×1015.
This result tells one that the local Lorentz frames are not at rest relative to the Earth's surface; rather, they are accelerating downward with the same acceleration, g g ggg, as acts on a free particle (to within 1 percent precision). To arrive at this conclusion, one analyzes the experiment in the laboratory reference frame, where everything (the experimental apparatus, the Earth, the Earth's gravitational field) is static. Relative to the laboratory a local Lorentz frame, momentarily at rest, accelerates downward (horizontal accelerations being ruled out by symmetry) with some unknown acceleration a a aaa. Equivalently, the laboratory accelerates upward (in + z + z +z+z+z direction) with acceleration a a aaa relative to the local Lorentz frame. Consequently, the spacetime metric in the laboratory frame has the standard form
(38.3) d s 2 = ( 1 + 2 a z ) d t 2 + d x 2 + d y 2 + d z 2 + O ( | x j | 2 ) d x α d x β , (38.3) d s 2 = ( 1 + 2 a z ) d t 2 + d x 2 + d y 2 + d z 2 + O x j 2 d x α d x β , {:(38.3)ds^(2)=-(1+2az)dt^(2)+dx^(2)+dy^(2)+dz^(2)+O(|x^(j)|^(2))dx^(alpha)dx^(beta)",":}\begin{equation*} d s^{2}=-(1+2 a z) d t^{2}+d x^{2}+d y^{2}+d z^{2}+O\left(\left|x^{j}\right|^{2}\right) d x^{\alpha} d x^{\beta}, \tag{38.3} \end{equation*}(38.3)ds2=(1+2az)dt2+dx2+dy2+dz2+O(|xj|2)dxαdxβ,
Figure 38.1.
The experiment of Pound and Rebka (1959) and Pound and Snider (1965) on the gravitational redshift of photons rising 22.5 meters against gravity through a helium-filled tube in a shaft in the Jefferson Physical Laboratory of Harvard University. The source of Co 57 Co 57 Co^(57)\mathrm{Co}^{57}Co57 had an initial strength greater than a curie. The 14.4 keV gamma rays had to pass in through an absorber enriched in Fe 57 Fe 57 Fe^(57)\mathrm{Fe}^{57}Fe57 to reach the large-window proportional counters. Both source and absorber were placed in temperature-regulated ovens. The velocity of the source consisted of two parts: one steady ( v M ) v M (v_(M))\left(v_{M}\right)(vM), to put the center of the emission line on the part of the transmission curve that is nearly straight; and the other alternating between + v J + v J +v_(J)+v_{J}+vJ and v J v J -v_(J)-v_{J}vJ, to sweep the transmission curve in this straight region; similarly when the steady velocity was v M v M -v_(M)-v_{\boldsymbol{M}}vM. The departure from symmetry between the two cases + v M + v M +v_(M)+v_{M}+vM and v M v M -v_(M)-v_{M}vM allows one to determine the offset v D v D v_(D)v_{D}vD (effect of gravitational redshift) from the zero-gravity case of stationary emitter and stationary absorber. The final result for the redshift was ( 0.9990 ± 0.0076 ) ( 0.9990 ± 0.0076 ) (0.9990+-0.0076)(0.9990 \pm 0.0076)(0.9990±0.0076) times the value 4.905 × 10 15 4.905 × 10 15 4.905 xx10^(-15)4.905 \times 10^{-15}4.905×1015 of 2 gh / c 2 2 gh / c 2 2gh//c^(2)2 \mathrm{gh} / \mathrm{c}^{2}2gh/c2 predicted from the principle of equivalence (difference between "up" experiment and "down" experiment). Diagrams adapted from Pound and Snider (1965).
which Track-2 readers have met in $ § 6.6 $ § 6.6 $§6.6\$ \S 6.6$§6.6 and 13.6 ; and Track-1 readers have met and used in Box 16.2. Moreover, in the laboratory frame the metric is static, gravity is static, and the experimental apparatus is static. Therefore the crest of each electromagnetic wave that climbs upward must follow a world line t ( z ) t ( z ) t(z)t(z)t(z) identical in form to the world lines of the crests before and after it; thus,
wave crest \#0: t = t 0 ( z ) wave crest \#1: t = t 0 ( z ) + Δ t , wave crest \#n: t = t 0 ( z ) + n Δ t .  wave crest \#0:  t = t 0 ( z )  wave crest \#1:  t = t 0 ( z ) + Δ t  wave crest \#n:  t = t 0 ( z ) + n Δ t . {:[" wave crest \#0: "t=t_(0)(z)],[" wave crest \#1: "t=t_(0)(z)+Delta t", "],[vdots],[" wave crest \#n: "t=t_(0)(z)+n Delta t.]:}\begin{aligned} & \text { wave crest \#0: } t=t_{0}(z) \\ & \text { wave crest \#1: } t=t_{0}(z)+\Delta t \text {, } \\ & \vdots \\ & \text { wave crest \#n: } t=t_{0}(z)+n \Delta t . \end{aligned} wave crest \#0: t=t0(z) wave crest \#1: t=t0(z)+Δt wave crest \#n: t=t0(z)+nΔt.
[Here, as in Schild's argument ($7.3) that redshift implies spacetime curvature, no assumption is made about the form of the wave-crest world lines t 0 ( z ) t 0 ( z ) t_(0)(z)t_{0}(z)t0(z); see Figure 7.1.] Hence, expressed in coordinate time, the interval between reception of successive wave crests is the same as the interval between emission. Both are Δ t Δ t Delta t\Delta tΔt. But the atomic clocks of the experiment ( Fe 57 Fe 57 Fe^(57)\mathrm{Fe}^{57}Fe57 nuclei) are assumed to measure proper time Δ τ Δ τ Delta tau-=\Delta \tau \equivΔτ ( g α β Δ x α Δ x β ) 1 / 2 g α β Δ x α Δ x β 1 / 2 (-g_(alpha beta)Deltax^(alpha)Deltax^(beta))^(1//2)\left(-g_{\alpha \beta} \Delta x^{\alpha} \Delta x^{\beta}\right)^{1 / 2}(gαβΔxαΔxβ)1/2, not coordinate time. Thus
λ received λ emitted = Δ τ received Δ τ emitted = ( 1 + a z received ) Δ t ( 1 + a z emitted ) Δ t = 1 + a ( z received z emitted ; λ received  λ emitted  = Δ τ received  Δ τ emitted  = 1 + a z received  Δ t 1 + a z emitted  Δ t = 1 + a z received  z emitted  ; {:[(lambda_("received "))/(lambda_("emitted "))=(Deltatau_("received "))/(Deltatau_("emitted "))=((1+az_("received "))Delta t)/((1+az_("emitted "))Delta t)],[=1+a(z_("received ")-z_("emitted ");:}]:}\begin{aligned} \frac{\lambda_{\text {received }}}{\lambda_{\text {emitted }}} & =\frac{\Delta \tau_{\text {received }}}{\Delta \tau_{\text {emitted }}}=\frac{\left(1+a z_{\text {received }}\right) \Delta t}{\left(1+a z_{\text {emitted }}\right) \Delta t} \\ & =1+a\left(z_{\text {received }}-z_{\text {emitted }} ;\right. \end{aligned}λreceived λemitted =Δτreceived Δτemitted =(1+azreceived )Δt(1+azemitted )Δt=1+a(zreceived zemitted ;
i.e.,
(38.4) Δ λ λ = a h [ theoretical prediction based on assumptions (i) that atomic clocks measure Δ τ = ( g α β Δ x α Δ x β ) 1 / 2 ; (ii) that electromagnetic radiation has the form of a wave train; (iii) that local Lorentz frames accelerate downward with acceleration a relative to the laboratory. ] (38.4) Δ λ λ = a h  theoretical prediction based on assumptions   (i) that atomic clocks measure  Δ τ = g α β Δ x α Δ x β 1 / 2 ;  (ii) that electromagnetic radiation has the form of a   wave train;   (iii) that local Lorentz frames accelerate downward   with acceleration  a  relative to the laboratory.  {:(38.4)(Delta lambda)/(lambda)=ah[[" theoretical prediction based on assumptions "],[" (i) that atomic clocks measure "Delta tau=(-g_(alpha beta)Deltax^(alpha)Deltax^(beta))^(1//2);],[" (ii) that electromagnetic radiation has the form of a "],[" wave train; "],[" (iii) that local Lorentz frames accelerate downward "],[" with acceleration "a" relative to the laboratory. "]]:}\frac{\Delta \lambda}{\lambda}=a h\left[\begin{array}{l} \text { theoretical prediction based on assumptions } \tag{38.4}\\ \text { (i) that atomic clocks measure } \Delta \tau=\left(-g_{\alpha \beta} \Delta x^{\alpha} \Delta x^{\beta}\right)^{1 / 2} ; \\ \text { (ii) that electromagnetic radiation has the form of a } \\ \text { wave train; } \\ \text { (iii) that local Lorentz frames accelerate downward } \\ \text { with acceleration } a \text { relative to the laboratory. } \end{array}\right](38.4)Δλλ=ah[ theoretical prediction based on assumptions  (i) that atomic clocks measure Δτ=(gαβΔxαΔxβ)1/2; (ii) that electromagnetic radiation has the form of a  wave train;  (iii) that local Lorentz frames accelerate downward  with acceleration a relative to the laboratory. ]
Direct comparison with the experimental result (38.2) reveals that local Lorentz frames in an Earthbound laboratory accelerate downward with the same acceleration g g ggg as acts on a test particle (to within 1 per cent precision).
[The above discussion is basically a reworked version of Schild's proof ($7.2) that the redshift experiment implies spacetime is curved. After all, how could spacetime possibly be flat if Lorentz frames in Washington, Moscow, and Peking all accelerate toward the Earth's center with g = 980 cm / sec 2 g = 980 cm / sec 2 g=980cm//sec^(2)g=980 \mathrm{~cm} / \mathrm{sec}^{2}g=980 cm/sec2 ?]
Other redshift experiments
Of all redshift experiments, the Pound-Rebka-Snider experiment is the easiest to interpret theoretically, because it was performed in a uniform gravitational field. Complementary to it is the experiment by Brault (1962), which measured the redshift of the sodium D 1 D 1 D_(1)\mathrm{D}_{1}D1 line emitted on the surface of the sun and received at Earth (Figure 38.2). To a precision of 5 per cent, he found a redshift of G M / R c 2 G M / R c 2 GM_(o.)//R_(o.)c^(2)G M_{\odot} / R_{\odot} c^{2}GM/Rc2, where M M M_(o.)M_{\odot}M and R R R_(o.)R_{\odot}R are the mass and radius of the sun. This is just the redshift to be expected if
Figure 38.2.
The measurement by Brault (1962) of the redshift of the D 1 D 1 D_(1)\mathrm{D}_{1}D1 line of sodium gives 1.05 ± 0.05 1.05 ± 0.05 1.05+-0.051.05 \pm 0.051.05±0.05 of the gravitational redshift predicted by general relativity. This strong line, in contrast to the weak lines used by earlier investigators (1) is emitted high in the sun's atmosphere, above the regions strongly disturbed by the pressure and convective shifts, and yet lower than the chromosphere, and (2) comes closer to standing up cleanly above the background than any other line in the visible spectrum. Brault built a new photoelectric spectrometer (upper diagram), with its slit vibrated mechanically back and forth across a narrow region of the spectrum, to define the position of the line peak (1) electronically, (2) independently of subjective judgment, and (3) with a precision greater by a factor of the order of ten than that afforded by conventional visual methods. The slit is considered set on a line when its mean position is such that the photomultiplier current contains no signal at the frequency of the modulation. The redshift measured in this way is corrected for orbital motion and for rotation of the sun and the Earth to give the points in circles and triangles in the lower diagram. Extrapolation to zero vibration of the slit gives the cited number for the redshift. Figure adapted from thesis of Brault (1962).
the local Lorentz frames, at each point along the photon trajectory, fall in step with freely falling test bodies.*
In summary, redshift experiments reveal that, to a precision of several percent, the local Lorentz frames at the Earth's surface and near the sun are unaccelerated relative to freely falling test bodies. Equivalently, test bodies move along straight lines in the local Lorentz frames. Equivalently, the test-body trajectories are geodesics of the metric g g g\boldsymbol{g}g.

§38.6. TESTS OF THE EQUIVALENCE PRINCIPLE

Of all the principles at work in gravitation, none is more central than the equivalence principle. As enunciated in § 16.2 § 16.2 §16.2\S 16.2§16.2, it states: "In any and every local Lorentz frame, anywhere and anytime in the universe, all the (nongravitational) laws of physics must take on their familiar special-relativistic forms."
That test bodies move along straight lines in local Lorentz frames (geodesic motion) is one aspect of the equivalence principle. Other aspects are the universality of Maxwell's equations
(38.5) F α β , β = 4 π J α and F α β , γ + F β γ , α + F γ α , β = 0 (38.5) F α β , β = 4 π J α  and  F α β , γ + F β γ , α + F γ α , β = 0 {:(38.5)F^(alpha beta)_(,beta)=4piJ^(alpha)" and "F_(alpha beta,gamma)+F_(beta gamma,alpha)+F_(gamma alpha,beta)=0:}\begin{equation*} F^{\alpha \beta}{ }_{, \beta}=4 \pi J^{\alpha} \text { and } F_{\alpha \beta, \gamma}+F_{\beta \gamma, \alpha}+F_{\gamma \alpha, \beta}=0 \tag{38.5} \end{equation*}(38.5)Fαβ,β=4πJα and Fαβ,γ+Fβγ,α+Fγα,β=0
in all local Lorentz frames; the universality of the law of local energy-momentum conservation
(38.6) T α β , β = 0 ; (38.6) T α β , β = 0 ; {:(38.6)T^(alpha beta)_(,beta)=0;:}\begin{equation*} T^{\alpha \beta}{ }_{, \beta}=0 ; \tag{38.6} \end{equation*}(38.6)Tαβ,β=0;
and the universality of the values of the dimensionless constants that enter into the local laws of physics:
α e e 2 c = 1 137.0360 = ( electromagnetic fine- structure constant ) (38.7) m neutron m proton = 1.00138 , m electron m proton = 1 1836.12 , etc. α e e 2 c = 1 137.0360 = (  electromagnetic fine-   structure constant  ) (38.7) m neutron  m proton  = 1.00138 , m electron  m proton  = 1 1836.12 ,  etc.  {:[alpha_(e)-=(e^(2))/(ℏc)=(1)/(137.0360 dots)=((" electromagnetic fine- ")/(" structure constant "))],[(38.7)(m_("neutron "))/(m_("proton "))=1.00138 dots","quad(m_("electron "))/(m_("proton "))=(1)/(1836.12 dots)","quad" etc. "]:}\begin{gather*} \alpha_{e} \equiv \frac{e^{2}}{\hbar c}=\frac{1}{137.0360 \ldots}=\binom{\text { electromagnetic fine- }}{\text { structure constant }} \\ \frac{m_{\text {neutron }}}{m_{\text {proton }}}=1.00138 \ldots, \quad \frac{m_{\text {electron }}}{m_{\text {proton }}}=\frac{1}{1836.12 \ldots}, \quad \text { etc. } \tag{38.7} \end{gather*}αee2c=1137.0360=( electromagnetic fine-  structure constant )(38.7)mneutron mproton =1.00138,melectron mproton =11836.12, etc. 
(Attention here is confined to dimensionless constants, since only they are independent of one's arbitrary choice of units of measure.)
If one focuses attention on a given event and asks about invariance of the form of the physical laws [equations (38.5), (38.6), etc.] from one Lorentz frame to another, one is then in the province of special relativity. Here a multitude of experiments verify the equivalence principle (see § 38.4 § 38.4 §38.4\S 38.4§38.4 ).
If one asks about variations in the form of the laws from one event to another, one opens up a Pandora's box of possibilities that one hardly dares to contemplate. However, no experimental evidence has ever given the slightest warrant to consider any such "departure from democracy" in the action of the laws of physics. Moreover, astronomical observations provide strong evidence that the laws of physics are the
same in distant stellar systems as in the solar system, and the same in distant galaxies as in our own Galaxy. (See, in Box 29.5, Edwin Hubble's expressions of joy upon discovering this.)
Constancy of the dimensionless "constants" from event to event can be tested to high precision, if one assumes constancy of the physical laws. Dirac (1937, 1938), Teller (1948), Jordan ( 1955 , 1959 ) ( 1955 , 1959 ) (1955,1959)(1955,1959)(1955,1959), Gamow (1967), and others have proposed that the fine-structure "constant" α e α e alpha_(e)\alpha_{e}αe might be a slowly varying scalar field, perhaps governed by a cosmological equation. However, rather stringent limits on such variations follow from data on the fine-structure splitting of the spectral lines of quasars and radio galaxies. For the quasar 3C 191 with redshift z = 1.95 z = 1.95 z=1.95z=1.95z=1.95, Bahcall, Sargent, and Schmidt (1967) find α e ( 3 C 191 ) / α e ( α e ( 3 C 191 ) / α e ( alpha_(e)(3C191)//alpha_(e)(\alpha_{e}(3 \mathrm{C} 191) / \alpha_{e}(αe(3C191)/αe( Earth ) = 0.97 ± 0.5 ) = 0.97 ± 0.5 )=0.97+-0.5)=0.97 \pm 0.5)=0.97±0.5. With a cosmological interpretation of the quasar redshift, this corresponds to a limit ( 1 / α e ) ( d α e / d t ) 1 / 10 11 1 / α e d α e / d t 1 / 10 11 (1//alpha_(e))(dalpha_(e)//dt) <= 1//10^(11)\left(1 / \alpha_{e}\right)\left(d \alpha_{e} / d t\right) \leq 1 / 10^{11}(1/αe)(dαe/dt)1/1011 years. An even tighter limit has been obtained from radiogalaxy data, where there is no question about the interpretation of the redshift. Bahcall and Schmidt (1967) measured fine-structure splitting in five radio galaxies with z 0.20 z 0.20 z~~0.20z \approx 0.20z0.20, corresponding to an emission of light 2 × 10 9 2 × 10 9 2xx10^(9)2 \times 10^{9}2×109 years ago. They obtained α e ( z = 0.20 ) / α e ( α e ( z = 0.20 ) / α e ( alpha_(e)(z=0.20)//alpha_(e)(\alpha_{e}(z=0.20) / \alpha_{e}(αe(z=0.20)/αe( Earth ) = 1.001 ± 0.002 ) = 1.001 ± 0.002 )=1.001+-0.002)=1.001 \pm 0.002)=1.001±0.002, which yields the limit | ( 1 / α e ) ( d α e / d t ) | 1 / 10 12 1 / α e d α e / d t 1 / 10 12 |(1//alpha_(e))(dalpha_(e)//dt)| <= 1//10^(12)\left|\left(1 / \alpha_{e}\right)\left(d \alpha_{e} / d t\right)\right| \leq 1 / 10^{12}|(1/αe)(dαe/dt)|1/1012 years.
Dyson (1972) points out that comparison of the rate of beta decay of Re 187 Re 187 Re^(187)\mathrm{Re}^{187}Re187 in times past (via osmium-rhenium abundance ratios in old ores) with the rate of beta-decay today provides a means to check on any possible variation of α e α e alpha_(e)\alpha_{e}αe with time more sensitive than redshift data and more sensitive than any changes in rates of alpha decay and fission between early times and now. He summarizes the available data on Re 187 Re 187 Re^(187)\mathrm{Re}^{187}Re187 and arrives at the limit
| ( 1 / α e ) ( d α e / d t ) | 5 / 10 15 years. 1 / α e d α e / d t 5 / 10 15  years.  |(1//alpha_(e))(dalpha_(e)//dt)| <= 5//10^(15)" years. "\left|\left(1 / \alpha_{e}\right)\left(d \alpha_{e} / d t\right)\right| \leq 5 / 10^{15} \text { years. }|(1/αe)(dαe/dt)|5/1015 years. 
For further evidence of the constancy of the fundamental constants see Minkowski and Wilson (1956), Dicke (1959a,b), Dicke and Peebles (1962b).
Spatial variations of α e , m neutron / m proton α e , m neutron  / m proton  alpha_(e),m_("neutron ")//m_("proton ")\alpha_{e}, m_{\text {neutron }} / m_{\text {proton }}αe,mneutron /mproton , and other "constants" in the solar system can be sought by means of Eötvös-type experiments. The reasoning [by Dicke (1969)] leading from such experiments to limits on any spatial variation of the constants is indirect. It recalls the reasoning used in standard treatises on polar molecules to deduce the acceleration of a polarizable molecule pulled on by an inhomogeneous electric field. It proceeds as follows.
Suppose one of the dimensionless "constants," " α α alpha\alphaα," depends on position. This will lead to a position-dependence of the total mass-energy of a laboratory test body. For example, if α e α e alpha_(e)\alpha_{e}αe depends on position, then the coulomb energy of an atomic nucleus will also ( E coul e 4 α e 2 ; δ M / E coul = 2 δ α e / α e E coul  e 4 α e 2 ; δ M / E coul  = 2 δ α e / α e E_("coul ")prope^(4)propalpha_(e)^(2);delta M//E_("coul ")=2deltaalpha_(e)//alpha_(e)E_{\text {coul }} \propto e^{4} \propto \alpha_{e}{ }^{2} ; \delta M / E_{\text {coul }}=2 \delta \alpha_{e} / \alpha_{e}Ecoul e4αe2;δM/Ecoul =2δαe/αe ). One can calculate the change in a test body's mass-energy when it is moved from x μ x μ x^(mu)x^{\mu}xμ to x μ + δ x μ x μ + δ x μ x^(mu)+deltax^(mu)x^{\mu}+\delta x^{\mu}xμ+δxμ by assuming no change at all in the body's structure during its displacement:
(38.8) δ M = ( M / α ) fixed structure ( α / x μ ) δ x μ . (38.8) δ M = ( M / α ) fixed structure  α / x μ δ x μ . {:(38.8)delta M=(del M//del alpha)_("fixed structure ")(del alpha//delx^(mu))deltax^(mu).:}\begin{equation*} \delta M=(\partial M / \partial \alpha)_{\text {fixed structure }}\left(\partial \alpha / \partial x^{\mu}\right) \delta x^{\mu} . \tag{38.8} \end{equation*}(38.8)δM=(M/α)fixed structure (α/xμ)δxμ.
After the displacement, a weakening of internal forces (due, e.g., to a decrease of α α alpha\alphaα )
Eötvös-type experiments as tests for spatial variation of fundamental constants
may cause a change in structure, but that change will be accompanied by a conversion of internal potential energy into internal kinetic energy, which conserves M M MMM.
Now consider the following thought experiment [an elaboration of the argument by which Einstein first derived the gravitational redshift (§7.2)]: Take n n nnn particles, each with mass-energy μ μ mu\muμ. Make the particles with a structure such that a negligible fraction of μ μ mu\muμ is associated with the "constant" of interest, α α alpha\alphaα :
(38.9) ( 1 / μ ) ( μ / α ) = 0 . (38.9) ( 1 / μ ) ( μ / α ) = 0 . {:(38.9)(1//mu)(del mu//del alpha)=0.:}\begin{equation*} (1 / \mu)(\partial \mu / \partial \alpha)=0 . \tag{38.9} \end{equation*}(38.9)(1/μ)(μ/α)=0.
Place these particles at a height h h hhh in a (locally) uniform Newtonian field. Combine them together there, releasing binding energy E B ( h ) E B ( h ) E_(B)(h)E_{B}(h)EB(h), to form a composite body of mass
(38.10) M = n μ E B ( h ) (38.10) M = n μ E B ( h ) {:(38.10)M=n mu-E_(B)(h):}\begin{equation*} M=n \mu-E_{B}(h) \tag{38.10} \end{equation*}(38.10)M=nμEB(h)
which depends in a significant manner on the "constant" α α alpha\alphaα,
(38.11) ( 1 / M ) ( M / α ) 0 (38.11) ( 1 / M ) ( M / α ) 0 {:(38.11)(1//M)(del M//del alpha)!=0:}\begin{equation*} (1 / M)(\partial M / \partial \alpha) \neq 0 \tag{38.11} \end{equation*}(38.11)(1/M)(M/α)0
Lower this body, and the released binding energy tied up in a little bag, a distance δ h δ h delta h\delta hδh. The total force acting is (in Newtonian language)
(38.12) F = M a + E B ( h ) g . (38.12) F = M a + E B ( h ) g . {:(38.12)F=Ma+E_(B)(h)g.:}\begin{equation*} F=M a+E_{B}(h) g . \tag{38.12} \end{equation*}(38.12)F=Ma+EB(h)g.
Here g g ggg is acceleration experienced by the type of mass-energy that is independent of α α alpha\alphaα when it is in free fall. In contrast, "free" fall of the assembled body M M MMM is not really free fall, because of the supplementary "polarization force" pulling on this object. Hence the assembled body in "free" fall experiences an acceleration, a a aaa, a little different from g g ggg. However, the mass that is accelerated is precisely M M MMM, and therefore the force required to produce this acceleration is given by the product Ma. The energy gained in lowering the body and the bag is
E ( down ) = F δ h = M a δ h + E B ( h ) g δ h . E (  down  ) = F δ h = M a δ h + E B ( h ) g δ h . E(" down ")=F delta h=Ma delta h+E_(B)(h)g delta h.E(\text { down })=F \delta h=M a \delta h+E_{B}(h) g \delta h .E( down )=Fδh=Maδh+EB(h)gδh.
Put this energy in the bag.
At h δ h h δ h h-delta hh-\delta hhδh use some of the energy from the bag to pull the body apart into its component particles. The energy required is
E B ( h δ h ) = n μ M ( h δ h ) = n μ M ( h ) + M α d α d h δ h = E B ( h ) + M α α h δ h ; E B ( h δ h ) = n μ M ( h δ h ) = n μ M ( h ) + M α d α d h δ h = E B ( h ) + M α α h δ h ; {:[E_(B)(h-delta h)=n mu-M(h-delta h)=n mu-M(h)+(del M)/(del alpha)(d alpha)/(dh)delta h],[=E_(B)(h)+(del M)/(del alpha)(del alpha)/(del h)delta h;]:}\begin{aligned} E_{B}(h-\delta h) & =n \mu-M(h-\delta h)=n \mu-M(h)+\frac{\partial M}{\partial \alpha} \frac{d \alpha}{d h} \delta h \\ & =E_{B}(h)+\frac{\partial M}{\partial \alpha} \frac{\partial \alpha}{\partial h} \delta h ; \end{aligned}EB(hδh)=nμM(hδh)=nμM(h)+Mαdαdhδh=EB(h)+Mααhδh;
so an energy
E bag = E B ( h ) + E ( down ) E B ( h δ h ) (38.13) = [ M a + E B ( h ) g M α d α d h ] δ h E bag = E B ( h ) + E (  down  ) E B ( h δ h ) (38.13) = M a + E B ( h ) g M α d α d h δ h {:[E_(bag)=E_(B)(h)+E(" down ")-E_(B)(h-delta h)],[(38.13)=[Ma+E_(B)(h)g-(del M)/(del alpha)(d alpha)/(dh)]delta h]:}\begin{align*} E_{\mathrm{bag}} & =E_{B}(h)+E(\text { down })-E_{B}(h-\delta h) \\ & =\left[M a+E_{B}(h) g-\frac{\partial M}{\partial \alpha} \frac{d \alpha}{d h}\right] \delta h \tag{38.13} \end{align*}Ebag=EB(h)+E( down )EB(hδh)(38.13)=[Ma+EB(h)gMαdαdh]δh
is left in the bag. Use this energy to raise the n n nnn particles and the bag back up to
height h h hhh. Assume total energy conservation, so that there will be no extra energy and no deficit when the n n nnn particles and bag have returned to the original state back at height h h hhh. This means that E bag E bag  E_("bag ")E_{\text {bag }}Ebag  must be precisely the right amount of energy to do the raising:
(38.14) E bag = n μ g δ h = [ M + E B ( h ) ] g δ h . (38.14) E bag  = n μ g δ h = M + E B ( h ) g δ h . {:(38.14)E_("bag ")=n mu g delta h=[M+E_(B)(h)]g delta h.:}\begin{equation*} E_{\text {bag }}=n \mu g \delta h=\left[M+E_{B}(h)\right] g \delta h . \tag{38.14} \end{equation*}(38.14)Ebag =nμgδh=[M+EB(h)]gδh.
Combining expressions (38.13) and (38.14) for E bag E bag  E_("bag ")E_{\text {bag }}Ebag , discover that
(38.15) a g = 1 M M α d α d h . (38.15) a g = 1 M M α d α d h . {:(38.15)a-g=(1)/(M)(del M)/(del alpha)(d alpha)/(dh).:}\begin{equation*} a-g=\frac{1}{M} \frac{\partial M}{\partial \alpha} \frac{d \alpha}{d h} . \tag{38.15} \end{equation*}(38.15)ag=1MMαdαdh.
Thus, under the assumption of total energy conservation (no perpetual-motion machines!), a spatial dependence of a physical "constant" α α alpha\alphaα will lead to the anomaly (38.15) in the acceleration of a body whose mass depends on α α alpha\alphaα.
Coulomb energy, which is proportional to α e 2 α e 2 alpha_(e)^(2)\alpha_{e}{ }^{2}αe2, amounts in a gold nucleus to 0.4 per cent of the mass, and to 0.1 per cent in an aluminum nucleus. Hence, a spatial variation in α e α e alpha_(e)\alpha_{e}αe should lead to a fractional difference in the gravitational accelerations of these two nuclei equal to
| a A u a A l g | 1 g 2 0.003 α e d α e d h 1 × 10 11 a A u a A l g 1 g 2 0.003 α e d α e d h 1 × 10 11 |(a_(Au)-a_(Al))/(g)|~~(1)/(g)2(0.003)/(alpha_(e))(dalpha_(e))/(dh) <= 1xx10^(-11)\left|\frac{a_{A u}-a_{A l}}{g}\right| \approx \frac{1}{g} 2 \frac{0.003}{\alpha_{e}} \frac{d \alpha_{e}}{d h} \leqq 1 \times 10^{-11}|aAuaAlg|1g20.003αedαedh1×1011
i.e.,
1 α e | d α e d h | 1 × 10 9 g 1 × 10 9 cm / sec 2 = 1 × 10 30 / cm 1 α e d α e d h 1 × 10 9 g 1 × 10 9 cm / sec 2 = 1 × 10 30 / cm (1)/(alpha_(e))|(dalpha_(e))/(dh)|≲1xx10^(-9)g~~1xx10^(-9)cm//sec^(2)=1xx10^(-30)//cm\frac{1}{\alpha_{e}}\left|\frac{d \alpha_{e}}{d h}\right| \lesssim 1 \times 10^{-9} g \approx 1 \times 10^{-9} \mathrm{~cm} / \mathrm{sec}^{2}=1 \times 10^{-30} / \mathrm{cm}1αe|dαedh|1×109g1×109 cm/sec2=1×1030/cm
at the Earth due to the sun.
Here use is made of the limit ( 1 × 10 11 1 × 10 11 1xx10^(-11)1 \times 10^{-11}1×1011 ) from Dicke's experiment ( $ 38.3 $ 38.3 $38.3\$ 38.3$38.3 ), and the acceleration g = 0.6 cm / sec 2 g = 0.6 cm / sec 2 g=0.6cm//sec^(2)g=0.6 \mathrm{~cm} / \mathrm{sec}^{2}g=0.6 cm/sec2 due to the sun at Earth.
Notice that this says the gradient of ln α e ln α e ln alpha_(e)\ln \alpha_{e}lnαe is less than 1 × 10 9 1 × 10 9 1xx10^(-9)1 \times 10^{-9}1×109 the gradient of the Newtonian potential!

§38.7. TESTS FOR THE EXISTENCE OF UNKNOWN LONG-RANGE FIELDS

Whether or not one accepts the assumption that test bodies move on geodesics of the metric, it remains conceivable that previously unknown long-range fields (fields with " 1 / r 1 / r 1//r1 / r1/r " fall-off at large distances) are somehow associated with gravity.
If "new" long-range fields (not metric, not electromagnetic) do exist, waiting to be discovered, then there are two ways by which they could influence matter. First, they could couple directly to matter, producing, for example, slight deviations from
Possible existence of new long-range fields associated with gravity
Direct vs. indirect coupling geodesic motion (deviations smaller than the limits of §38.5), or slight dependences of masses of particles on position (dependences smaller than the limits of § 38.6 § 38.6 §38.6\S 38.6§38.6 ). Second (and harder to detect), they could couple indirectly to matter by being mere
Experimental limits on direct-coupling fields:
(1) Hughes-Drever experiment
(2) ether-drift experiments
participants in field equations that determine the geometry of spacetime. This section will describe tests for direct-coupling effects. Theories with fields that couple indirectly will be described in Box 39.1 , and tests for such fields will be discussed in Chapter 40.
Dicke (1964b), using his framework for analyzing tests of gravitation theories (§38.2), has shown that several null experiments place stringent limits on unknown, direct-coupling, long-range fields.
One of these experiments is the "Hughes-Drever Experiment" [Hughes, Robinson, and Beltran-Lopez (1960); Drever (1961)]. It can be thought of as a search for a symmetric second-rank tensor field h α β h α β h_(alpha beta)h_{\alpha \beta}hαβ that produces slight deviations of test-body trajectories from geodesics of the metric g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ. Unless one's experiments happen to be made in a region of spacetime where h α β h α β h_(alpha beta)h_{\alpha \beta}hαβ is a constant multiple of g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ ("mere rescaling of all lengths and times by a constant factor"), this tensor field must produce anisotropies in the properties of spacetime-which, in turn, will cause anisotropies in the inertial mass of a nucleon, and in turn will cause in an atomic nucleus relative shifts of degenerate energy levels with different magnetic quantum numbers. The Hughes-Drever experiment places stringent limits on such shifts, and thereby on a possible tensor field h α β h α β h_(alpha beta)h_{\alpha \beta}hαβ. To quote Dicke (1964, p. 186), "If two [tensor] fields are present with the one strongly anisotropic in a coordinate system chosen to make the other isotropic, the strength of [direct] coupling to one must be only of the order of 10 22 10 22 10^(-22)10^{-22}1022 that of the other.... [Moreover], on the moving Earth with ever-changing velocity, anisotropy would be expected at some season." From the experiments of Hughes and Drever, then, one concludes that there is not the slightest evidence for the presence of a second tensor field. For further details see Dicke and Peebles (1962a).
Another series of experiments, called "ether-drift experiments," places stringent limits on any unknown, long-range vector field that couples directly to mass-energy. One can imagine such a field of cosmological origin. Being cosmological, the 4-vector would most naturally be expected to point in the same direction as the 4 -vector u u u\boldsymbol{u}u of the "cosmological fluid" (identical with the time direction e 0 e 0 e_(0)\boldsymbol{e}_{0}e0 of a frame in which the cosmic microwave radiation is isotropic). The 4 -vector of the new field would then have spatial components in any other frame. In principle an observer could use them to discern his direction of motion and speed relative to the mean rest frame of the universe. The ether-drift experiments search for effects of such a field.
For example, the experiment of Turner and Hill (1964) searches for a dependence of clock rates on such a vector field, by examining the transverse Doppler shift as a function of direction for an emitter on the rim of a centrifuge and a receiver at its center (Figure 38.3). If there is any effect, it would most naturally be expected to have the form
(38.17) ( rate of clock moving relative to universe with speed β ) ( rate of clock at rest relative to universe ) = 1 + γ β 2 , γ a small constant. (38.17) (  rate of clock moving relative   to universe with speed  β ) (  rate of clock at rest   relative to universe  ) = 1 + γ β 2 , γ  a small constant.  {:(38.17)(((" rate of clock moving relative ")/(" to universe with speed "beta)))/(((" rate of clock at rest ")/(" relative to universe ")))=1+gammabeta^(2)","quad gamma" a small constant. ":}\begin{equation*} \frac{\binom{\text { rate of clock moving relative }}{\text { to universe with speed } \boldsymbol{\beta}}}{\binom{\text { rate of clock at rest }}{\text { relative to universe }}}=1+\gamma \beta^{2}, \quad \gamma \text { a small constant. } \tag{38.17} \end{equation*}(38.17)( rate of clock moving relative  to universe with speed β)( rate of clock at rest  relative to universe )=1+γβ2,γ a small constant. 
Figure 38.3.
The experiment of Turner and Hill (1964) looks for a dependence of proper clock rate (the clock being a Co 57 Co 57 Co^(57)\mathrm{Co}^{57}Co57 source placed near the rim of the centrifuge) on velocity relative to the distant matter of the universe; or, in operational terms, relative to a "new local field" described by a 4 -vector. The 14.4 keV gamma rays from the C 57 C 57 C^(57)\mathrm{C}^{57}C57 already experience a second-order Doppler shift of 1.3 parts in 10 13 10 13 10^(13)10^{13}1013. One searches for an additional shift γ β 2 γ β 2 gammabeta^(2)\gamma \beta^{2}γβ2 where β = u + v ( e x cos ω t + β = u + v e x cos ω t + beta=u+v(e_(x)cos omega t+:}\boldsymbol{\beta}=\boldsymbol{u}+v\left(\boldsymbol{e}_{x} \cos \omega t+\right.β=u+v(excosωt+ e y sin ω t e y sin ω t e_(y)sin omega t\boldsymbol{e}_{y} \sin \omega teysinωt ) is the velocity relative to the frame in which the scalar field is purely timelike. The transmission of the gamma rays through the Fe 57 Fe 57 Fe^(57)\mathrm{Fe}^{57}Fe57 absorber will drop linearly with any such additional shift, and will be noted as a drop in the counting rate of the NaI crystal. The source was 10 cm from the axis of rotation and the centrifuge turned at 15 , 000 rpm 15 , 000 rpm 15,000rpm15,000 \mathrm{rpm}15,000rpm. The value of γ γ gamma\gammaγ deduced from the experiment was ( 1 ± 4 ) × 10 5 ( 1 ± 4 ) × 10 5 (1+-4)xx10^(-5)(1 \pm 4) \times 10^{-5}(1±4)×105.
A clock at the center of the centrifuge has β = u = u e x β = u = u e x beta=u=ue_(x)\boldsymbol{\beta}=\boldsymbol{u}=\boldsymbol{u} \boldsymbol{e}_{x}β=u=uex, whereas one on the rim has β = u + v ( e x cos ω t + e y sin ω t ) β = u + v e x cos ω t + e y sin ω t beta=u+v(e_(x)cos omega t+e_(y)sin omega t)\boldsymbol{\beta}=\boldsymbol{u}+v\left(\boldsymbol{e}_{x} \cos \omega t+\boldsymbol{e}_{y} \sin \omega t\right)β=u+v(excosωt+eysinωt). Thus, the shift between rim and disk should vary with position
Δ λ / λ = Δ v / ν = 2 γ u v cos ω t + usual transverse shift. Δ λ / λ = Δ v / ν = 2 γ u v cos ω t +  usual transverse shift.  Delta lambda//lambda=-Delta v//nu=-2gamma uv cos omega t+" usual transverse shift. "\Delta \lambda / \lambda=-\Delta v / \nu=-2 \gamma u v \cos \omega t+\text { usual transverse shift. }Δλ/λ=Δv/ν=2γuvcosωt+ usual transverse shift. 
The data of Turner and Hill, using the Mössbauer effect, show that
(38.18) | γ | < 4 × 10 5 . (38.18) | γ | < 4 × 10 5 . {:(38.18)|gamma| < 4xx10^(-5).:}\begin{equation*} |\gamma|<4 \times 10^{-5} . \tag{38.18} \end{equation*}(38.18)|γ|<4×105.
Hence, a cosmological vector field, if present, has only a weak direct coupling to matter.
For further discussion of these experiments and references on others like them, see Dicke (1964b).

OTHER THEORIES OF GRAVITY AND THE POST-NEWTONIAN APPROXIMATION

Role of alternative gravitation theories as foils for experimental tests

§39.1. OTHER THEORIES

Among all bodies of physical law none has ever been found that is simpler or more beautiful than Einstein's geometric theory of gravity (Chapters 16 and 17); nor has any theory of gravity ever been discovered that is more compelling.
As experiment after experiment has been performed, and one theory of gravity after another has fallen by the wayside a victim of the observations, Einstein's theory has stood firm. No purported inconsistency between experiment and Einstein's laws of gravity has ever surmounted the test of time.
Query: Why then bother to examine alternative theories of gravity? Reply: To have "foils" against which to test Einstein's theory.
To say that Einstein's geometrodynamics is "battle-tested" is to say it has won every time it has been tried against a theory that makes a different prediction. How then does one select new antagonists for decisive new trials by combat?
Not all theories of gravity are created equal. Very few, among the multitude in the literature, are sufficiently viable to be worth comparison with general relativity or with future experiments. The "worthy" theories are those which satisfy three criteria for viability: self-consistency, completeness, and agreement with past experiment.
Self-consistency is best illustrated by describing several theories that fail this test. The classic example of an internally inconsistent theory is the spin-two field theory of gravity [Fierz and Pauli (1939); Box 7.1 here], which is equivalent to linearized general relativity (Chapter 18). The field equations of the spin-two theory imply that all gravitating bodies move along straight lines in global Lorentz reference frames, whereas the equations of motion of the theory insist that gravity deflects
bodies away from straight-line motion. (When one tries to remedy this inconsistency, one finds oneself being "bootstrapped" up to general relativity; see route 5 of Box 17.2.) Another self-inconsistent theory is that of Kustaanheimo (1966). It predicts zero gravitational redshift when the wave version of light (Maxwell theory) is used, and nonzero redshift when the particle version (photon) is used.
Completeness: To be complete a theory of gravity must be capable of analyzing from "first principles" the outcome of every experiment of interest. It must therefore mesh with and incorporate a consistent set of laws for electromagnetism, quantum mechanics, and all other physics. No theory is complete if it postulates that atomic clocks measure the "interval" d τ = ( g α β d x α d x β ) 1 / 2 d τ = g α β d x α d x β 1 / 2 d tau=(-g_(alpha beta)dx^(alpha)dx^(beta))^(1//2)d \tau=\left(-g_{\alpha \beta} d x^{\alpha} d x^{\beta}\right)^{1 / 2}dτ=(gαβdxαdxβ)1/2 constructed from a particular metric. Atomic clocks are complex systems whose behavior must be calculated from the fundamental laws of quantum theory and electromagnetism. No theory is complete if it postulates that planets move on geodesics. Planets are complex systems whose motion must be calculated from fundamental laws for the response of stressed matter to gravity. For further discussion see § § 16.4 , 20.6 § § 16.4 , 20.6 §§16.4,20.6\S \S 16.4,20.6§§16.4,20.6, and 40.9 .
Agreement with past experiment: The necessity that a theory agree, to within several standard deviations, with the "four standard tests" (gravitational redshift, perihelion shift, electromagnetic-wave deflection, and radar time-delay) is obvious. Equally obvious but often forgotten is the need to agree with the expansion of the universe (historically the ace among all aces of general relativity) and with observations at the more everyday, Newtonian level. Example: Birkhoff's (1943) theory predicts the same redshift, perihelion shift, deflection, and time-delay as general relativity. But it requires that the pressure inside gravitating bodies equal the total density of mass-energy, p = ρ p = ρ p=rhop=\rhop=ρ; and, as a consequence, it demands that sound waves travel with the speed of light. Of course, this prediction disagrees violently with experiment. Therefore, Birkhoff's theory is not viable. Another example: Whitehead's (1922) theory of gravity was long considered a viable alternative to Einstein's theory, because it makes exactly the same prediction as Einstein for the "four standard tests." Not until the work of Will (1971b) was it realized that Whitehead's theory predicts a time-dependence for the ebb and flow of ocean tides that is completely contradicted by everyday experience (see § 40.8 § 40.8 §40.8\S 40.8§40.8 ).

§39.2. METRIC THEORIES OF GRAVITY

Two lines of argument narrow attention to a restricted class of gravitation theories, called metric theories.
The first line of argument constitutes the theme of the preceding chapter. It examined experiment after experiment, and reached two conclusions: (1) spacetime possesses a metric; and (2) that metric satisfies the equivalence principle (the standard special relativistic laws of physics are valid in each local Lorentz frame). Theories of gravity that incorporate these two principles are called metric theories.* In brief, Chapter 38 says, "For any adequate description of gravity, look to a metric theory."
Why attention focuses on metric theories of gravity
How metric theories differ
Weak-field, slow-motion expansion of a metric theory
Exception: Cartan's (1922b, 1923) theory ["general relativity plus torsion"; see Trautman (1972)] is nonmetric, but agrees with experiment and is experimentally indistinguishable from general relativity with the technology of the 1970's.
The second line of argument pointing to metric theories begins with the issue of completeness (preceding section). To be complete, a theory must incorporate a self-consistent version of all the nongravitational laws of physics. No one has found a way to incorporate the rest of physics with ease except to introduce a metric, and then invoke the principle of equivalence. Other approaches lead to dismaying complexity, and usually to failure of the theory on one of the three counts of self-consistency, completeness, and agreement with past experiment. All the theories known to be viable in 1973 are metric, except Cartan's. [See Ni (1972b); Will (1972).]
In only one significant way do metric theories of gravity differ from each other: their laws for the generation of the metric. In general relativity theory, the metric is generated directly by the stress-energy of matter and of nongravitational fields. In Dicke-Brans-Jordan theory (Box 39.1, p. 1070), matter and nongravitational fields generate a scalar field ϕ ϕ phi\phiϕ; then ϕ ϕ phi\phiϕ acts together with the matter and other fields to generate the metric. Expressed in the language of § 38.7 , ϕ § 38.7 , ϕ §38.7,phi\S 38.7, \phi§38.7,ϕ is a "new long-range field" that couples indirectly to matter. As another example, a theory devised by Ni ( 1970 , 1972 ) Ni ( 1970 , 1972 ) Ni(1970,1972)\mathrm{Ni}(1970,1972)Ni(1970,1972) (Box 39.1) possesses a flat-space metric η η eta\boldsymbol{\eta}η and a universal time coordinate t t ttt ("prior geometry"; see §17.6); η η eta\boldsymbol{\eta}η acts together with matter and nongravitational fields to generate a scalar field ϕ ϕ phi\phiϕ; and then η , t η , t eta,t\boldsymbol{\eta}, tη,t, and ϕ ϕ phi\phiϕ combine to create the physical metric g g g\boldsymbol{g}g that enters into the equivalence principle.
All three of the above theories-Einstein, Dicke-Brans-Jordan, Ni-were viable in the summer of 1971, when this section was written. But in autumn 1971 Ni's theory, and many other theories that had been regarded as viable, were proved by Nordtvedt and Will (1972) to disagree with experiment. This is an example of the rapidity of current progress in experimental tests of gravitation theory!
Henceforth, in this chapter and the next, attention will be confined to metric theories of gravity and their comparison with experiment.

§39.3. POST-NEWTONIAN LIMIT AND PPN FORMALISM

The solar system, where experiments to distinguish between metric theories are performed, has weak gravity,
(39.1a) | Φ | =∣ Newtonian potential ∣≲ 10 6 ; (39.1a) | Φ | =∣  Newtonian potential  ∣≲ 10 6 ; {:(39.1a)|Phi|=∣" Newtonian potential "∣≲10^(-6);:}\begin{equation*} |\Phi|=\mid \text { Newtonian potential } \mid \lesssim 10^{-6} ; \tag{39.1a} \end{equation*}(39.1a)|Φ|=∣ Newtonian potential ∣≲106;
moreover, the matter that generates solar-system gravity moves slowly
(39.1b) v 2 = ( velocity relative to solar-system center of mass ) 2 10 7 (39.1b) v 2 = (  velocity relative to solar-system center of mass  ) 2 10 7 {:(39.1b)v^(2)=(" velocity relative to solar-system center of mass ")^(2)≲10^(-7):}\begin{equation*} v^{2}=(\text { velocity relative to solar-system center of mass })^{2} \lesssim 10^{-7} \tag{39.1b} \end{equation*}(39.1b)v2=( velocity relative to solar-system center of mass )2107
and has small stress and internal energies
(39.1c) | T j k | / ρ o = ( stress divided by baryon "mass" density ) 10 6 , (39.1d) Π = ( ρ ρ o ) / ρ o = ( internal energy density per unit baryon "mass" density ) 10 6 . (39.1c) T j k / ρ o = (  stress divided by baryon "mass" density  ) 10 6 , (39.1d) Π = ρ ρ o / ρ o = (  internal energy density per   unit baryon "mass" density  ) 10 6 . {:[(39.1c)|T_(jk)|//rho_(o)=(" stress divided by baryon "mass" density ") <= 10^(-6)","],[(39.1d)Pi=(rho-rho_(o))//rho_(o)=((" internal energy density per ")/(" unit baryon "mass" density "))≲10^(-6).]:}\begin{gather*} \left|T_{j k}\right| / \rho_{o}=(\text { stress divided by baryon "mass" density }) \leqq 10^{-6}, \tag{39.1c}\\ \Pi=\left(\rho-\rho_{o}\right) / \rho_{o}=\binom{\text { internal energy density per }}{\text { unit baryon "mass" density }} \lesssim 10^{-6} . \tag{39.1d} \end{gather*}(39.1c)|Tjk|/ρo=( stress divided by baryon "mass" density )106,(39.1d)Π=(ρρo)/ρo=( internal energy density per  unit baryon "mass" density )106.
[Here the baryon "mass" density ρ o ρ o rho_(o)\rho_{o}ρo, despite its name, and despite the fact it is sometimes even more misleadingly called "density of rest mass-energy," is actually a measure of the number density of baryons n n nnn, and nothing more. It is defined as the product of n n nnn with some standard figure for the mass per baryon, μ 0 μ 0 mu_(0)\mu_{0}μ0, in some well-defined standard state; thus,
(39.1e) ρ o n μ 0 ] (39.1e) ρ o n μ 0 {:(39.1e){:rho_(o)-=nmu_(0)*]:}\begin{equation*} \left.\rho_{o} \equiv n \mu_{0} \cdot\right] \tag{39.1e} \end{equation*}(39.1e)ρonμ0]
Consequently, the analysis of solar-system experiments using any metric theory of gravity can be simplified, without significant loss of accuracy, by a simultaneous expansion in the small parameters | Φ | , v 2 , | T j k | / ρ o | Φ | , v 2 , T j k / ρ o |Phi|,v^(2),|T_(jk)|//rho_(o)|\Phi|, v^{2},\left|T_{j k}\right| / \rho_{o}|Φ|,v2,|Tjk|/ρo, and Π Π Pi\PiΠ. Such a "weak-field, slow-motion expansion" gives: (1) flat, empty spacetime in "zero order"; (2) the Newtonian treatment of the solar system in "first order"; and (3) post-Newtonian corrections to the Newtonian treatment in "second order".
The formalism of Newtonian theory plus post-Newtonian corrections is called the "post-Newtonian approximation." Each metric theory has its own post-Newtonian approximation. Despite the great differences between metric theories themselves, their post-Newtonian approximations are very similar. They are so similar, in fact, that one can construct a single post-Newtonian theory of gravity, devoid of any reference to indirectly coupling fields ( ϕ ϕ phi\phiϕ in Dicke-Brans-Jordan; η , t η , t eta,t\boldsymbol{\eta}, tη,t, and ϕ ϕ phi\phiϕ in Ni; see Box 39.1), that contains the post-Newtonian approximation of every conceivable metric theory as a special case. This all-inclusive post-Newtonian theory is called the "Parametrized Post-Newtonian (PPN) Formalism." It contains a set of parameters (called "PPN parameters") that can be specified arbitrarily. One set of values for these parameters makes the PPN formalism identical to the post-Newtonian limit of general relativity; another set of values makes it the post-Newtonian limit of Dicke-Brans-Jordan theory, etc.
Subsequent sections of this chapter present a version of the PPN formalism devised by Clifford M. Will and Kenneth Nordtvedt, Jr. (1972). [See also Will (1972).] This version, containing ten PPN parameters, encompasses as special cases nearly every metric theory of gravity known to the authors. The few exceptions [Whitehead (1922) and theories reviewed by Will (1973)] all disagree with experiment. One can include them in the PPN formalism by adding additional terms and parameters.
The ten parameters are described heuristically in Box 39.2, for the convenience of readers who would skip the full details of the formalism ( $ § 39.4 39.12 $ § 39.4 39.12 $§39.4-39.12\$ \S 39.4-39.12$§39.439.12 ).
How accurate is the PPN formalism? Or, stated more precisely, how accurately does the post-Newtonian approximation agree with the metric theory from which it comes? In the solar system, where | Φ | , v 2 , | T j k | / ρ o | Φ | , v 2 , T j k / ρ o |Phi|,v^(2),|T_(jk)|//rho_(o)|\Phi|, v^{2},\left|T_{j k}\right| / \rho_{o}|Φ|,v2,|Tjk|/ρo, and Π Π Pi\PiΠ are all 10 6 10 6 <= 10^(-6)\leqq 10^{-6}106, the post-Newtonian approximation makes fractional errors of 10 6 10 6 ≲10^(-6)\lesssim 10^{-6}106 in quantities of post-Newtonian order, and fractional errors of 10 12 10 12 ≲10^(-12)\lesssim 10^{-12}1012 in quantities of Newtonian order. For example, it misrepresents the deflection of light by 10 6 × 10 6 × <= 10^(-6)xx\leqq 10^{-6} \times106× (post-Newtonian deflection) 10 6 10 6 ∼10^(-6)\sim 10^{-6}106 seconds of arc. And it ignores relativistic deformations of the Earth's orbit of magnitude < 10 12 × < 10 12 × < 10^(-12)xx<10^{-12} \times<1012× (one astronomical unit) 10 10 ∼10\sim 1010 centimeters. Clearly, there is no need in the 1970's to use higher-order corrections to the postNewtonian approximation; and hence no need to construct a "parametrized post-post-Newtonian framework." However, in the words of Shapiro (1971b): "If one projects from the achievements in the last decade, it is not unreasonable to predict
Post-Newtonian approximation
PPN formalism
Accuracy of PPN formalism in solar system

Box 39.1 THE THEORIES OF DICKE-BRANS-JORDAN AND OF NI

A. Dicke-Brans-Jordan

References: Brans and Dicke (1961); Jordan (1959). [Notes: This is the special case η = 1 η = 1 eta=-1\eta=-1η=1 of Jordan's theory. An alternative mathematical representation of the theory is given by Dicke (1962).]
Fields associated with gravity:
ϕ ϕ phi\phiϕ, a long-range scalar field;
g g g\boldsymbol{g}g, the metric of spacetime (from which are constructed the covariant derivative grad\boldsymbol{\nabla} and the curvature tensors, in the usual manner).
Equations by which these fields are determined:
The trace of the stress-energy tensor generates ϕ ϕ phi\phiϕ via the curved-spacetime wave equation
ϕ = ϕ ; α , α = 8 π 3 + 2 ω T , ϕ = ϕ ; α , α = 8 π 3 + 2 ω T , ◻phi=phi_(;alpha)^(,alpha)=(8pi)/(3+2omega)T,\square \phi=\phi_{; \alpha}^{, \alpha}=\frac{8 \pi}{3+2 \omega} T,ϕ=ϕ;α,α=8π3+2ωT,
where ω ω omega\omegaω is the dimensionless "Dicke coupling constant."
The stress-energy tensor and ϕ ϕ phi\phiϕ together generate the metric (i.e., the spacetime curvature) via the field equations
G α β = 8 π ϕ T α β + ω ϕ 2 ( ϕ , α ϕ , β 1 2 g α β ϕ , μ ϕ , μ ) + 1 ϕ ( ϕ ; α β g α β ϕ ) , G α β = 8 π ϕ T α β + ω ϕ 2 ϕ , α ϕ , β 1 2 g α β ϕ , μ ϕ , μ + 1 ϕ ϕ ; α β g α β ϕ , G_(alpha beta)=(8pi)/(phi)T_(alpha beta)+(omega)/(phi^(2))(phi_(,alpha)phi_(,beta)-(1)/(2)g_(alpha beta)phi_(,mu)phi^(,mu))+(1)/(phi)(phi_(;alpha beta)-g_(alpha beta)◻phi),G_{\alpha \beta}=\frac{8 \pi}{\phi} T_{\alpha \beta}+\frac{\omega}{\phi^{2}}\left(\phi_{, \alpha} \phi_{, \beta}-\frac{1}{2} g_{\alpha \beta} \phi_{, \mu} \phi^{, \mu}\right)+\frac{1}{\phi}\left(\phi_{; \alpha \beta}-g_{\alpha \beta} \square \phi\right),Gαβ=8πϕTαβ+ωϕ2(ϕ,αϕ,β12gαβϕ,μϕ,μ)+1ϕ(ϕ;αβgαβϕ),
where G α β G α β G_(alpha beta)G_{\alpha \beta}Gαβ is the Einstein tensor.
Variational principle for these equations:
δ [ ϕ R ω ( ϕ , α ϕ , α / ϕ ) + 16 π L ] ( g ) 1 / 2 d 4 x = 0 δ ϕ R ω ϕ , α ϕ , α / ϕ + 16 π L ( g ) 1 / 2 d 4 x = 0 delta int[phi R-omega(phi_(,alpha)phi^(,alpha)//phi)+16 pi L](-g)^(1//2)d^(4)x=0\delta \int\left[\phi R-\omega\left(\phi_{, \alpha} \phi^{, \alpha} / \phi\right)+16 \pi L\right](-g)^{1 / 2} d^{4} x=0δ[ϕRω(ϕ,αϕ,α/ϕ)+16πL](g)1/2d4x=0
where R R RRR is the scalar curvature and L L LLL is the matter Lagrangian.
Equivalence principle is satisfied:
The special-relativistic laws of physics are valid, without change, in the local Lorentz frames of the metric g g g\boldsymbol{g}g.
Consequence: the scalar field does not exert any direct influence on matter; its only role is that of participant in the field equations that determine the geometry of spacetime. It is an "indirectly coupling field" in the sense of §38.7.
This theory is self-consistent, complete, and for ω > 5 ω > 5 omega > 5\omega>5ω>5 in "reasonable" accord (two standard deviations or better) with all pre-1973 experiments.

B. Ni

References: Ni ( 1970 , 1972 ) ( 1970 , 1972 ) (1970,1972)(1970,1972)(1970,1972)
Fields associated with gravity:
η η eta\boldsymbol{\eta}η, a flat "background metric" ("prior geometry" in sense of §17.6). There exist,
by assumption, coordinate systems ("background Lorentz frames") in which everywhere at once η 00 = 1 , η 0 j = 0 η 00 = 1 , η 0 j = 0 eta_(00)=-1,eta_(0j)=0\eta_{00}=-1, \eta_{0 j}=0η00=1,η0j=0, and η j k = δ j k η j k = δ j k eta_(jk)=delta_(jk)\eta_{j k}=\delta_{j k}ηjk=δjk.
t t ttt, a scalar field called the "universal time coordinate" ("prior geometry" in sense of §17.6), which is so "tuned" to the background metric that
t 1 α β = 0 , t , α t , β η α β = 1 t 1 α β = 0 , t , α t , β η α β = 1 t_(1alpha beta)=0,quadt_(,alpha)^(t),_(beta)eta^(alpha beta)=-1t_{1 \alpha \beta}=0, \quad t_{, \alpha}{ }^{t},{ }_{\beta} \eta^{\alpha \beta}=-1t1αβ=0,t,αt,βηαβ=1
where "" denotes covariant derivative with respect to η η eta\boldsymbol{\eta}η.
This means there exists a background Lorentz frame (the "rest frame of the universe") in which x 0 = t x 0 = t x^(0)=tx^{0}=tx0=t.
ϕ ϕ phi\phiϕ, a scalar field called the "scalar gravitational field".
g g g\boldsymbol{g}g, the metric of spacetime (from which are constructed the covariant derivative grad\boldsymbol{\nabla} and the curvature tensors, in the usual manner).
Equations by which these fields are determined:
The stress-energy of spacetime generates the scalar gravitational field ϕ ϕ phi\phiϕ via the wave equation
ϕ ϕ , α ; α = 2 π T α β g α β / ϕ = 4 π T α β [ η α β e 2 ϕ + ( e 2 ϕ + e 2 ϕ ) t , α t , β ] . ϕ ϕ , α ; α = 2 π T α β g α β / ϕ = 4 π T α β η α β e 2 ϕ + e 2 ϕ + e 2 ϕ t , α t , β . {:[◻phi-=phi^(,alpha);alpha=-2piT^(alpha beta)delg_(alpha beta)//del phi],[=4piT^(alpha beta)[eta_(alpha beta)e^(-2phi)+(e^(2phi)+e^(-2phi))t_(,alpha)t_(,beta)].]:}\begin{aligned} \square \phi & \equiv \phi^{, \alpha} ; \alpha=-2 \pi T^{\alpha \beta} \partial g_{\alpha \beta} / \partial \phi \\ & =4 \pi T^{\alpha \beta}\left[\eta_{\alpha \beta} e^{-2 \phi}+\left(e^{2 \phi}+e^{-2 \phi}\right) t_{, \alpha} t_{, \beta}\right] . \end{aligned}ϕϕ,α;α=2πTαβgαβ/ϕ=4πTαβ[ηαβe2ϕ+(e2ϕ+e2ϕ)t,αt,β].
ϕ , η ϕ , η phi,eta\phi, \boldsymbol{\eta}ϕ,η, and t t ttt together determine the metric of spacetime through the algebraic relation
g = e 2 ϕ η + ( e 2 ϕ e 2 ϕ ) d t d t . g = e 2 ϕ η + e 2 ϕ e 2 ϕ d t d t . g=e^(-2phi)eta+(e^(-2phi)-e^(2phi))dt ox dt.\boldsymbol{g}=e^{-2 \phi} \boldsymbol{\eta}+\left(e^{-2 \phi}-e^{2 \phi}\right) \boldsymbol{d} t \otimes \boldsymbol{d} t .g=e2ϕη+(e2ϕe2ϕ)dtdt.
Note: In the "rest frame of the universe" that is presupposed in this theory, this metric reduces to
d s 2 = g α β d x α d x β = e 2 ϕ d t 2 + e 2 ϕ ( d x 2 + d y 2 + d z 2 ) d s 2 = g α β d x α d x β = e 2 ϕ d t 2 + e 2 ϕ d x 2 + d y 2 + d z 2 ds^(2)=g_(alpha beta)dx^(alpha)dx^(beta)=-e^(2phi)dt^(2)+e^(-2phi)(dx^(2)+dy^(2)+dz^(2))d s^{2}=g_{\alpha \beta} d x^{\alpha} d x^{\beta}=-e^{2 \phi} d t^{2}+e^{-2 \phi}\left(d x^{2}+d y^{2}+d z^{2}\right)ds2=gαβdxαdxβ=e2ϕdt2+e2ϕ(dx2+dy2+dz2)
Variational principle for the field equation for ϕ ϕ phi\phiϕ :
where L L LLL is the matter Lagrangian.
Equivalence principle is satisfied:
The special-relativistic laws of physics are valid, without change, in the local Lorentz frames of the spacetime metric g g g\boldsymbol{g}g.
Consequence: ϕ , η ϕ , η phi,eta\phi, \boldsymbol{\eta}ϕ,η, and t t ttt do not exert any direct influence on matter; they are "indirectly coupling fields" in the sense of $ 38.7 $ 38.7 $38.7\$ 38.7$38.7.
This theory is self-consistent and complete. If the solar system were at rest in the
"rest frame of the universe", the theory would agree with all experiments to date-except, possibly, the expansion of the universe. But the motion of the solar system through the universe leads to serious disagreement with experiment (Will and Nordtvedt 1972; §40.8).

Box 39.2 HEURISTIC DESCRIPTION OF THE TEN PPN PARAMETERS

Parameter What it measures, relative to general relativity a ^("a "){ }^{\text {a }} Value in General Relativity Value in Dicke-BransJordan Theory b ^("b "){ }^{\text {b }} Value in Ni's Theory b ^("b "){ }^{\text {b }}
γ γ gamma\gammaγ How much space curvature ( g j k ) g j k (g_(jk))\left(g_{j k}\right)(gjk) is produced by unit rest mass? 1 1 + ω 2 + ω 1 + ω 2 + ω (1+omega)/(2+omega)\frac{1+\omega}{2+\omega}1+ω2+ω 1
β β beta\betaβ How much nonlinearity is there in the superposition law for gravity ( g 00 ) g 00 (g_(00))\left(g_{00}\right)(g00) ? 1 1 1
β 1 β 1 beta_(1)\beta_{1}β1 How much gravity ( g 00 g 00 g_(00)g_{00}g00 ) is produced by unit kinetic energy ( 1 2 ρ o v 2 ) 1 2 ρ o v 2 ((1)/(2)rho_(o)v^(2))\left(\frac{1}{2} \rho_{o} v^{2}\right)(12ρov2) ? 1 3 + 2 ω 4 + 2 ω 3 + 2 ω 4 + 2 ω (3+2omega)/(4+2omega)\frac{3+2 \omega}{4+2 \omega}3+2ω4+2ω 1
β 2 β 2 beta_(2)\beta_{2}β2 How much gravity ( g 00 g 00 g_(00)g_{00}g00 ) is produced by unit gravitational potential energy ( ρ o U ) ρ o U (rho_(o)U)\left(\rho_{o} U\right)(ρoU) ? 1 1 + 2 ω 4 + 2 ω 1 + 2 ω 4 + 2 ω (1+2omega)/(4+2omega)\frac{1+2 \omega}{4+2 \omega}1+2ω4+2ω 1
β 3 β 3 beta_(3)\beta_{3}β3 How much gravity ( g 00 g 00 g_(00)g_{00}g00 ) is produced by unit internal energy ( ρ 0 Π ρ 0 Π rho_(0)Pi\rho_{0} \Piρ0Π ) ? 1 1 1
β 4 β 4 beta_(4)\beta_{4}β4 How much gravity ( g 00 g 00 g_(00)g_{00}g00 ) is produced by unit pressure (p)? 1 1 + ω 2 + ω 1 + ω 2 + ω (1+omega)/(2+omega)\frac{1+\omega}{2+\omega}1+ω2+ω 1
ζ ζ zeta\zetaζ How much more gravity ( g 00 g 00 g_(00)g_{00}g00 ) is produced by radial kinetic energy [ 1 2 ρ o ( v r ^ ) 2 ] 1 2 ρ o ( v r ^ ) 2 [(1)/(2)rho_(o)(v*( hat(r)))^(2)]\left[\frac{1}{2} \rho_{o}(v \cdot \hat{r})^{2}\right][12ρo(vr^)2]-i.e., kinetic energy of motion toward obser-ver-than by transverse kinetic energy? 0 0 0
η η eta\etaη How much more gravity ( g 00 ) g 00 (g_(00))\left(g_{00}\right)(g00) is produced by radial stress [ r ^ t r ^ ] [ r ^ t r ^ ] [ hat(r)*t* hat(r)][\hat{r} \cdot \boldsymbol{t} \cdot \hat{r}][r^tr^] than by transverse stress? 0 0 0
Δ 1 Δ 1 Delta_(1)\Delta_{1}Δ1 How much dragging of inertial frames ( g 0 j ) g 0 j (g_(0j))\left(g_{0 j}\right)(g0j) is produced by unit momentum ( ρ o v ) ρ o v (rho_(o)v)\left(\rho_{o} v\right)(ρov) ? 1 10 + 7 ω 14 + 7 ω 10 + 7 ω 14 + 7 ω (10+7omega)/(14+7omega)\frac{10+7 \omega}{14+7 \omega}10+7ω14+7ω 1 7 1 7 -(1)/(7)-\frac{1}{7}17
Δ 2 Δ 2 Delta_(2)\Delta_{2}Δ2 How much easier is it for momentum ( ρ o v ρ o v rho_(o)v\rho_{o} vρov ) to drag inertial frames radially (toward the observer) than in a transverse direction? 1 1 1
Parameter What it measures, relative to general relativity ^("a ") Value in General Relativity Value in Dicke-BransJordan Theory ^("b ") Value in Ni's Theory ^("b ") gamma How much space curvature (g_(jk)) is produced by unit rest mass? 1 (1+omega)/(2+omega) 1 beta How much nonlinearity is there in the superposition law for gravity (g_(00)) ? 1 1 1 beta_(1) How much gravity ( g_(00) ) is produced by unit kinetic energy ((1)/(2)rho_(o)v^(2)) ? 1 (3+2omega)/(4+2omega) 1 beta_(2) How much gravity ( g_(00) ) is produced by unit gravitational potential energy (rho_(o)U) ? 1 (1+2omega)/(4+2omega) 1 beta_(3) How much gravity ( g_(00) ) is produced by unit internal energy ( rho_(0)Pi ) ? 1 1 1 beta_(4) How much gravity ( g_(00) ) is produced by unit pressure (p)? 1 (1+omega)/(2+omega) 1 zeta How much more gravity ( g_(00) ) is produced by radial kinetic energy [(1)/(2)rho_(o)(v*( hat(r)))^(2)]-i.e., kinetic energy of motion toward obser-ver-than by transverse kinetic energy? 0 0 0 eta How much more gravity (g_(00)) is produced by radial stress [ hat(r)*t* hat(r)] than by transverse stress? 0 0 0 Delta_(1) How much dragging of inertial frames (g_(0j)) is produced by unit momentum (rho_(o)v) ? 1 (10+7omega)/(14+7omega) -(1)/(7) Delta_(2) How much easier is it for momentum ( rho_(o)v ) to drag inertial frames radially (toward the observer) than in a transverse direction? 1 1 1| Parameter | What it measures, relative to general relativity ${ }^{\text {a }}$ | Value in General Relativity | Value in Dicke-BransJordan Theory ${ }^{\text {b }}$ | Value in Ni's Theory ${ }^{\text {b }}$ | | :---: | :---: | :---: | :---: | :---: | | $\gamma$ | How much space curvature $\left(g_{j k}\right)$ is produced by unit rest mass? | 1 | $\frac{1+\omega}{2+\omega}$ | 1 | | $\beta$ | How much nonlinearity is there in the superposition law for gravity $\left(g_{00}\right)$ ? | 1 | 1 | 1 | | $\beta_{1}$ | How much gravity ( $g_{00}$ ) is produced by unit kinetic energy $\left(\frac{1}{2} \rho_{o} v^{2}\right)$ ? | 1 | $\frac{3+2 \omega}{4+2 \omega}$ | 1 | | $\beta_{2}$ | How much gravity ( $g_{00}$ ) is produced by unit gravitational potential energy $\left(\rho_{o} U\right)$ ? | 1 | $\frac{1+2 \omega}{4+2 \omega}$ | 1 | | $\beta_{3}$ | How much gravity ( $g_{00}$ ) is produced by unit internal energy ( $\rho_{0} \Pi$ ) ? | 1 | 1 | 1 | | $\beta_{4}$ | How much gravity ( $g_{00}$ ) is produced by unit pressure (p)? | 1 | $\frac{1+\omega}{2+\omega}$ | 1 | | $\zeta$ | How much more gravity ( $g_{00}$ ) is produced by radial kinetic energy $\left[\frac{1}{2} \rho_{o}(v \cdot \hat{r})^{2}\right]$-i.e., kinetic energy of motion toward obser-ver-than by transverse kinetic energy? | 0 | 0 | 0 | | $\eta$ | How much more gravity $\left(g_{00}\right)$ is produced by radial stress $[\hat{r} \cdot \boldsymbol{t} \cdot \hat{r}]$ than by transverse stress? | 0 | 0 | 0 | | $\Delta_{1}$ | How much dragging of inertial frames $\left(g_{0 j}\right)$ is produced by unit momentum $\left(\rho_{o} v\right)$ ? | 1 | $\frac{10+7 \omega}{14+7 \omega}$ | $-\frac{1}{7}$ | | $\Delta_{2}$ | How much easier is it for momentum ( $\rho_{o} v$ ) to drag inertial frames radially (toward the observer) than in a transverse direction? | 1 | 1 | 1 |
that in the 1980's techniques will be available to detect second-order effects of general relativity. At that point the ratio of theoretical to experimental relativists may take a sharp turn downwards."
Actually, there are a few exceptions to the claim that the post-Newtonian approximation suffices for the 1970's. These exceptions occur where the external universe impinges on and influences the solar system. For example, gravitational waves propagating into the solar system from distant sources (Chapters 35-37) are ignored by every post-Newtonian approximation and by the PPN framework. They must be treated using a full metric theory or a weak-field, "fast-motion" approximation
to such a theory. Similarly, time-dependence of the "gravitational constant" ( $ 40.8 $ 40.8 $40.8\$ 40.8$40.8 ), induced in some theories by expansion of the universe, is beyond the scope of the PPN formalism, as is the expansion itself.
The PPN formalism is used not only in interpreting experimental tests of gravitation theories, but also as a powerful tool in theoretical astrophysics. By specializing all the PPN parameters to unity, except ζ = η = 0 ζ = η = 0 zeta=eta=0\zeta=\eta=0ζ=η=0, one obtains the post-Newtonian approximation to Einstein's theory of gravity. This post-Newtonian approximation can then be used (and has been used extensively) to calculate general relativistic corrections to such phenomena as the structure and stability of stars.*

Historical and Notational Notes

The earliest parametrizations of the post-Newtonian approximation were performed, and used in interpreting solar system experiments, by Eddington (1922), Robertson (1962), and Schiff ( 1962 , 1967 ) ( 1962 , 1967 ) (1962,1967)(1962,1967)(1962,1967). However, they dealt solely with the vacuum gravitational field outside an isolated, spherical body (the sun). Nordtvedt (1968b, 1969) devised the first full PPN formalism, capable of treating all aspects of the solar system; he treated the sun, planets, and moon as made from "gases" of point-particles (atoms) that interact gravitationally and electromagnetically. Will (1971c) later used techniques devised by Chandrasekhar (1965a) to modify Nordtvedt's formalism, so that it employs a stressed, continuous-matter description of celestial bodies. The version of the formalism presented here, devised by Will and Nordtvedt (1972), generalizes all previous versions to acquire "post-Galilean invariance" [see Chandrasekhar and Contopolous (1967)]. The most detailed and up-to-date review article on the PPN formalism is Will (1972).
In the literature of post-Newtonian physics and the PPN formalism, the Newtonian potential is described traditionally not by Φ Φ Phi\PhiΦ, but by
(39.2) U Φ + ρ o ( x ) d 3 x | x x | (39.2) U Φ + ρ o x d 3 x x x {:(39.2)U-=-Phi-=+int(rho_(o)(x^('))d^(3)x^('))/(|x-x^(')|):}\begin{equation*} U \equiv-\Phi \equiv+\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}\right) d^{3} x^{\prime}}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} \tag{39.2} \end{equation*}(39.2)UΦ+ρo(x)d3x|xx|
To avoid confusion, this chapter and the next will use U U UUU, although the rest of the book uses Φ Φ Phi\boldsymbol{\Phi}Φ.
Turn now to a detailed, Track-2 exposition of the PPN formalism.

§39.4. PPN COORDINATE SYSTEM

The PPN formalism covers the solar system (or whatever system is being analyzed) with coordinates ( t , x j ) ( t , x j ) t , x j t , x j (t,x_(j))-=(t,x^(j))\left(t, x_{j}\right) \equiv\left(t, x^{j}\right)(t,xj)(t,xj) that are as nearly globally Lorentz as possible:
(39.3) g α β = η α β + h α β , | h α β | M / R 10 6 . (39.3) g α β = η α β + h α β , h α β M / R 10 6 . {:(39.3)g_(alpha beta)=eta_(alpha beta)+h_(alpha beta)","quad|h_(alpha beta)| <= M_(o.)//R_(o.)∼10^(-6).:}\begin{equation*} g_{\alpha \beta}=\eta_{\alpha \beta}+h_{\alpha \beta}, \quad\left|h_{\alpha \beta}\right| \leqq M_{\odot} / R_{\odot} \sim 10^{-6} . \tag{39.3} \end{equation*}(39.3)gαβ=ηαβ+hαβ,|hαβ|M/R106.
Applications of PPN formalism to astrophysics
History and notation of PPN formalism
EXPOSITION OF PPN
FORMALISM:
Coordinate system
The rest of this chapter is Track 2. No earlier Track-2 material is needed as preparation for it, but the following will be helpful:
(1) Chapter 7 (incompatibility of gravity and special relativity)
(2) § 17.6 (no prior geometry);
(3) §§36.9-36.11 (generation of gravitational waves); and
(4) Chapter 38 (tests of foundations).
This chapter is not needed as preparation for any later chapter, but it will be helpful in Chapter 40 (solar-system tests)
(In this sense the PPN formalism is like linearized theory; see Chapter 18.) The velocity of the coordinate system (i.e., 4 -velocity of its spatial origin) is so chosen that the solar system is approximately at rest in these coordinates. (Whether the center of mass of the solar system is precisely at rest, or is moving with some low velocity v ( M / R ) 1 / 2 10 3 300 km / sec v M / R 1 / 2 10 3 300 km / sec v <= (M_(o.)//R_(o.))^(1//2)∼10^(-3)∼300km//secv \leqq\left(M_{\odot} / R_{\odot}\right)^{1 / 2} \sim 10^{-3} \sim 300 \mathrm{~km} / \mathrm{sec}v(M/R)1/2103300 km/sec, is a matter for the user of the formalism to decide. For more on the options, see § § 39.9 § § 39.9 §§39.9\S \S 39.9§§39.9 and 39.12.)
The PPN coordinates provide one with a natural " 3 + 1 3 + 1 3+13+13+1 " split of spacetime into space plus time. That split is conveniently treated using the notation of three-dimensional, flat-space vector analysis-even though spacetime and the three-dimensional hypersurfaces x 0 = x 0 = x^(0)=x^{0}=x0= constant are both curved. The resultant three-dimensional formalism will look more like Newtonian theory than like general relativity-as, indeed, one wishes it to; after all, one's goal is to study small relativistic corrections to Newtonian theory!

§39.5. DESCRIPTION OF THE MATTER IN THE SOLAR SYSTEM

Description of matter
Relative to the PPN coordinates, the matter of the solar system (idealized as a stressed medium) has a coordinate-velocity field
(39.4) v j d x j / d x 0 (39.4) v j d x j / d x 0 {:(39.4)v_(j)-=dx_(j)//dx^(0):}\begin{equation*} v_{j} \equiv d x_{j} / d x^{0} \tag{39.4} \end{equation*}(39.4)vjdxj/dx0
Choose an event P P P\mathscr{P}P, and in its neighborhood transform to an orthonormal frame that moves with the matter there. Orient the spatial axes e j e j e_(j)\boldsymbol{e}_{j}ej of this comoving frame so that they coincide as accurately as possible with the PPN coordinate axes. (This requirement will be made more precise in §39.10.) In the orthonormal comoving frame, define the following quantities, which describe the state of the matter:
(39.5a) ( density of total mass-energy ) ρ ; (baryon "mass" density) ρ o (39.5b) ( number density of baryons, n ) × ( standard rest mass per baryon, μ 0 for matter in some standard state ) ; (39.5c) (specific internal energy density ) Π ( ρ ρ o ) / ρ o ; (39.5d) (components of stress tensor) t i ^ j ^ e i ^ T e j ^ ; (39.5e) (pressure) p 1 3 ( t x ^ x ^ + t y ^ y ^ + t z ^ z ^ ) (average of stress over all directions). (39.5a) (  density of total mass-energy  ) ρ ;  (baryon "mass" density)  ρ o (39.5b) (  number density   of baryons,  n ) × (  standard rest mass per baryon,  μ 0  for matter in some standard state  ) ; (39.5c)  (specific internal energy density  ) Π ρ ρ o / ρ o ; (39.5d)  (components of stress tensor)  t i ^ j ^ e i ^ T e j ^ ; (39.5e)  (pressure)  p 1 3 t x ^ x ^ + t y ^ y ^ + t z ^ z ^  (average of stress over all directions).  {:[(39.5a)(" density of total mass-energy ")-=rho;],[" (baryon "mass" density) "-=rho_(o)],[(39.5b)-=((" number density ")/(" of baryons, "n))xx((" standard rest mass per baryon, "mu_(0))/(" for matter in some standard state "));],[(39.5c)" (specific internal energy density ")-=Pi-=(rho-rho_(o))//rho_(o);],[(39.5d)" (components of stress tensor) "-=t_( hat(i) hat(j))-=e_( hat(i))*T*e_( hat(j));],[(39.5e){:[" (pressure) "-=p-=(1)/(3)(t_( hat(x) hat(x))+t_( hat(y) hat(y))+t_( hat(z) hat(z)))],[quad-=" (average of stress over all directions). "]:}]:}\begin{align*} & (\text { density of total mass-energy }) \equiv \rho ; \tag{39.5a}\\ & \text { (baryon "mass" density) } \equiv \rho_{o} \\ & \equiv\binom{\text { number density }}{\text { of baryons, } n} \times\binom{\text { standard rest mass per baryon, } \mu_{0}}{\text { for matter in some standard state }} ; \tag{39.5b}\\ & \text { (specific internal energy density }) \equiv \Pi \equiv\left(\rho-\rho_{o}\right) / \rho_{o} ; \tag{39.5c}\\ & \text { (components of stress tensor) } \equiv t_{\hat{i} \hat{j}} \equiv \boldsymbol{e}_{\hat{i}} \cdot \boldsymbol{T} \cdot \boldsymbol{e}_{\hat{j}} ; \tag{39.5d}\\ & \begin{aligned} \text { (pressure) } \equiv p \equiv \frac{1}{3}\left(t_{\hat{x} \hat{x}}+t_{\hat{y} \hat{y}}+t_{\hat{z} \hat{z}}\right) \\ \quad \equiv \text { (average of stress over all directions). } \end{aligned} \tag{39.5e} \end{align*}(39.5a)( density of total mass-energy )ρ; (baryon "mass" density) ρo(39.5b)( number density  of baryons, n)×( standard rest mass per baryon, μ0 for matter in some standard state );(39.5c) (specific internal energy density )Π(ρρo)/ρo;(39.5d) (components of stress tensor) ti^j^ei^Tej^;(39.5e) (pressure) p13(tx^x^+ty^y^+tz^z^) (average of stress over all directions). 
Anisotropies (i.e., shears) in the stress are important only in planets such as the Earth; and even there they are dominated by the isotropic pressure:
(39.6) t i j = p δ i j + p × [ corrections 1 ] . (39.6) t i j = p δ i j + p × [  corrections  1 ] . {:(39.6)t_(ij)=pdelta_(ij)+p xx[" corrections "≪1].:}\begin{equation*} t_{i j}=p \delta_{i j}+p \times[\text { corrections } \ll 1] . \tag{39.6} \end{equation*}(39.6)tij=pδij+p×[ corrections 1].
For many purposes, especially inside the sun, one can ignore the anisotropies, thereby approximating the solar-system matter as a perfect fluid.*
The isotropic part of the radiation field gives a significant contribution to the pressure, p p ppp, and the density of internal energy, ρ o Π ρ o Π rho_(o)Pi\rho_{o} \PiρoΠ, inside the sun. However, the anisotropic radiation flux is ignored in the stress-energy tensor. This approximation is allowable because in the sun the outward energy flux carried by radiation is less than 10 15 10 15 10^(-15)10^{-15}1015 of the internal energy density ρ o Π ρ o Π rho_(o)Pi\rho_{o} \PiρoΠ; in planets it is even less.

§39.6. NATURE OF THE POST-NEWTONIAN EXPANSION

For any gravitationally bound configuration such as the solar system, the Newtonian approximation imposes limits on the sizes of various dimensionless physical quantities (see exercise 39.1):
(39.7) ϵ 2 maximum value of Newtonian potential U values anywhere of U , v 2 , p / ρ o , | t i j | / ρ o , Π . (39.7) ϵ 2  maximum value of Newtonian potential  U  values anywhere of  U , v 2 , p / ρ o , t i j / ρ o , Π . {:[(39.7)epsilon^(2)-=" maximum value of Newtonian potential "U],[≳" values anywhere of "U","v^(2)","p//rho_(o)","|t_(ij)|//rho_(o)","Pi.]:}\begin{align*} \epsilon^{2} \equiv & \text { maximum value of Newtonian potential } U \tag{39.7}\\ & \gtrsim \text { values anywhere of } U, v^{2}, p / \rho_{o},\left|t_{i j}\right| / \rho_{o}, \Pi . \end{align*}(39.7)ϵ2 maximum value of Newtonian potential U values anywhere of U,v2,p/ρo,|tij|/ρo,Π.
Relative magnitudes of expansion parameters
(The Newtonian potential at the center of the sun is ϵ 2 10 5 ϵ 2 10 5 epsilon^(2)∼10^(-5)\epsilon^{2} \sim 10^{-5}ϵ2105. The values of p / ρ o p / ρ o p//rho_(o)p / \rho_{o}p/ρo, t i j / ρ o t i j / ρ o t_(ij)//rho_(o)t_{i j} / \rho_{o}tij/ρo, and Π Π Pi\PiΠ there are also 10 5 10 5 ∼10^(-5)\sim 10^{-5}105, and they are much smaller elsewhere. The orbital velocities of the planets are all less than 100 km / sec = 3 × 10 4 100 km / sec = 3 × 10 4 100km//sec=3xx10^(-4)100 \mathrm{~km} / \mathrm{sec}=3 \times 10^{-4}100 km/sec=3×104, so v 2 < 10 7 v 2 < 10 7 v^(2) < 10^(-7)v^{2}<10^{-7}v2<107.) Moreover, changes with time of all quantities at fixed x j x j x_(j)x_{j}xj are due primarily to the motion of the matter. As a result, time derivatives are small by O ( ϵ ) O ( ϵ ) O(epsilon)O(\epsilon)O(ϵ) compared to space derivatives,
(39.8) | A / t A / x j | | v j | ϵ for any quantity A , (39.8) A / t A / x j v j ϵ  for any quantity  A , {:(39.8)|(del A//del t)/(del A//delx_(j))|∼|v_(j)| <= epsilon" for any quantity "A",":}\begin{equation*} \left|\frac{\partial A / \partial t}{\partial A / \partial x_{j}}\right| \sim\left|v_{j}\right| \leq \epsilon \text { for any quantity } A, \tag{39.8} \end{equation*}(39.8)|A/tA/xj||vj|ϵ for any quantity A,
although not in the radiation zone, where outgoing gravitational waves flow (distance >=\geq one light year from Sun). Consequently, the radiation zone must be excluded from the analysis when one makes a post-Newtonian expansion. To treat it requires different techniques, e.g., those of Chapter 36.
Conditions 39.7 and 39.8 suggest that one expand the metric coefficients in powers of the small parameter ϵ ϵ epsilon\epsilonϵ, treating U , v 2 , p / ρ o , t i j / ρ o U , v 2 , p / ρ o , t i j / ρ o U,v^(2),p//rho_(o),t_(ij)//rho_(o)U, v^{2}, p / \rho_{o}, t_{i j} / \rho_{o}U,v2,p/ρo,tij/ρo, and Π Π Pi\PiΠ as though they were all of O ( ϵ 2 ) O ϵ 2 O(epsilon^(2))O\left(\epsilon^{2}\right)O(ϵ2) (often they are smaller!), and treating time derivatives as O ( ϵ ) O ( ϵ ) O(epsilon)O(\epsilon)O(ϵ) smaller than space derivatives.
In this "post-Newtonian" expansion, terms odd in ϵ ϵ epsilon\epsilonϵ (i.e., terms such as
(39.9) ρ o ( x , t ) v j ( x , t ) | x x | d 3 x M R v ϵ 3 (39.9) ρ o x , t v j x , t x x d 3 x M R v ϵ 3 {:(39.9)int(rho_(o)(x^('),t)v_(j)(x^('),t))/(|x-x^(')|)d^(3)x^(')∼(M)/(R)v∼epsilon^(3):}\begin{equation*} \int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right) v_{j}\left(\boldsymbol{x}^{\prime}, t\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime} \sim \frac{M}{R} v \sim \epsilon^{3} \tag{39.9} \end{equation*}(39.9)ρo(x,t)vj(x,t)|xx|d3xMRvϵ3
whose total number of v v vvv 's and ( / t ) ( / t ) (del//del t)(\partial / \partial t)(/t) 's is odd) change sign under time reversal,
whereas terms even in ϵ ϵ epsilon\epsilonϵ do not. Time reversal ( x 0 = x 0 ) x 0 ¯ = x 0 (x^( bar(0))=-x^(0))\left(x^{\overline{0}}=-x^{0}\right)(x0=x0) also changes the sign of g 0 j ( g 0 j = g 0 j ) g 0 j g 0 ¯ j = g 0 j g_(0j)(g_( bar(0)j)=-g_(0j))g_{0 j}\left(g_{\overline{0} j}=-g_{0 j}\right)g0j(g0j=g0j), but leaves g 00 g 00 g_(00)g_{00}g00 and g j k g j k g_(jk)g_{j k}gjk unchanged. Therefore, g 0 j g 0 j g_(0j)g_{0 j}g0j must contain only terms odd in ϵ ϵ epsilon\epsilonϵ; whereas g 00 g 00 g_(00)g_{00}g00 and g j k g j k g_(jk)g_{j k}gjk must contain only even terms. (Actually, this ceases to be the case when radiation damping enters the picture. In the real world one always insists on outgoing-wave boundary conditions. But time reversal converts outgoing waves to ingoing waves; so an extra sign change is required to convert back to out. Therefore, radiation damping terms in the near-zone metric are even in ϵ ϵ epsilon\epsilonϵ for g 0 j g 0 j g_(0j)g_{0 j}g0j, but odd for g 00 g 00 g_(00)g_{00}g00 and g j k g j k g_(jk)g_{j k}gjk. However, radiation damping does not come into play until order ϵ 5 ϵ 5 epsilon^(5)\epsilon^{5}ϵ5 beyond Newtonian theory-see Chapter 36-so it will be ignored here.)
Expanded form of metric
The form of the expansion is already known through Newtonian order (see §17.4, with Φ Φ Phi\PhiΦ replaced by U U -U-UU ): Newtonian gravity is only obtained when one demanc. that
$$
g 00 = 1 + 2 U + [ terms ϵ 4 ] , (39.10) g 0 j = [ terms ϵ 3 ] , g i j = δ i j + [ terms ϵ 2 ] . g 00 = 1 + 2 U +  terms  ϵ 4 , (39.10) g 0 j =  terms  ϵ 3 , g i j = δ i j +  terms  ϵ 2 . {:[g_(00)=-1+2U+[" terms "≲epsilon^(4)]","],[(39.10)g_(0j)=[" terms "≲epsilon^(3)]","],[g_(ij)=delta_(ij)+[" terms " <= epsilon^(2)].]:}\begin{align*} g_{00} & =-1+2 U+\left[\text { terms } \lesssim \epsilon^{4}\right], \\ g_{0 j} & =\left[\text { terms } \lesssim \epsilon^{3}\right], \tag{39.10}\\ g_{i j} & =\delta_{i j}+\left[\text { terms } \leq \epsilon^{2}\right] . \end{align*}g00=1+2U+[ terms ϵ4],(39.10)g0j=[ terms ϵ3],gij=δij+[ terms ϵ2].
$$
The stated limits on the higher-order corrections are dictated by demanding that the space components of the geodesic equation agree with the Newtonian equation of motion:
(39.11) d 2 x j d t 2 d 2 x j d τ 2 = Γ j α β d x α d τ d x β d τ Γ j α β d x α d t d x β d t = Γ j 00 2 Γ j o k v k Γ j k ! v k v = U , j + terms of order { ϵ g 0 k , j ; ϵ 2 g k , j } . (39.11) d 2 x j d t 2 d 2 x j d τ 2 = Γ j α β d x α d τ d x β d τ Γ j α β d x α d t d x β d t = Γ j 00 2 Γ j o k v k Γ j k ! v k v = U , j +  terms of order  ϵ g 0 k , j ; ϵ 2 g k , j . {:[(39.11)(d^(2)x_(j))/(dt^(2))~~(d^(2)x_(j))/(dtau^(2))=-Gamma^(j)_(alpha beta)(dx^(alpha))/(d tau)(dx^(beta))/(d tau)~~-Gamma^(j)_(alpha beta)(dx^(alpha))/(dt)(dx^(beta))/(dt)],[=-Gamma^(j)_(00)-2Gamma^(j)_(ok)v_(k)-Gamma^(j)_(k!)v_(k)v_(ℓ)],[=U_(,j)+" terms of order "{epsilong_(0k,j);epsilon^(2)g_(kℓ,j)}.]:}\begin{align*} \frac{d^{2} x_{j}}{d t^{2}} \approx \frac{d^{2} x_{j}}{d \tau^{2}} & =-\Gamma^{j}{ }_{\alpha \beta} \frac{d x^{\alpha}}{d \tau} \frac{d x^{\beta}}{d \tau} \approx-\Gamma^{j}{ }_{\alpha \beta} \frac{d x^{\alpha}}{d t} \frac{d x^{\beta}}{d t} \tag{39.11}\\ & =-\Gamma^{j}{ }_{00}-2 \Gamma^{j}{ }_{o k} v_{k}-\Gamma^{j}{ }_{k!} v_{k} v_{\ell} \\ & =U_{, j}+\text { terms of order }\left\{\epsilon g_{0 k, j} ; \epsilon^{2} g_{k \ell, j}\right\} . \end{align*}(39.11)d2xjdt2d2xjdτ2=ΓjαβdxαdτdxβdτΓjαβdxαdtdxβdt=Γj002ΓjokvkΓjk!vkv=U,j+ terms of order {ϵg0k,j;ϵ2gk,j}.
One would get the wrong Newtonian limit if g 0 k g 0 k g_(0k)g_{0 k}g0k were O ( ϵ ) O ( ϵ ) O(epsilon)O(\epsilon)O(ϵ) or greater, and if g k δ k ι g k δ k ι g_(kℓ)-delta_(k iota)g_{k \ell}-\delta_{k \iota}gkδkι were O ( 1 ) O ( 1 ) O(1)O(1)O(1) or greater.
The above pattern continues on to all orders in the expansion. Thus in the geodesic equation, and also in the law of local conservation of energy-momentum T α β ; β = 0 T α β ; β = 0 T^(alpha beta)_(;beta)=0T^{\alpha \beta}{ }_{; \beta}=0Tαβ;β=0, g 00 g 00 g_(00)g_{00}g00 always goes hand-in-hand with ϵ g 0 k ϵ g 0 k epsilong_(0k)\epsilon g_{0 k}ϵg0k and ϵ 2 g j k ϵ 2 g j k epsilon^(2)g_(jk)\epsilon^{2} g_{j k}ϵ2gjk (see exercise 39.2). Therefore, the post-Newtonian expansion has the form summarized in Box 39.3.

EXERCISES

Exercise 39.1. ORDERS OF MAGNITUDE IN GRAVITATIONALLY BOUND SYSTEMS

Use Newtonian theory to derive conditions (39.7) for any gravitationally bound system. [Hint: Such concepts as orbital velocities, the speeds of sound and shear waves, the virial theorem, and hydrostatic equilibrium are relevant.]

Exercise 39.2. PATTERN OF TERMS IN POST-NEWTONIAN EXPANSION

Verify the statements in the paragraph following equation (39.11). In particular, suppose that one wishes to evaluate the coordinate acceleration, d 2 x j / d t 2 d 2 x j / d t 2 d^(2)x_(j)//dt^(2)d^{2} x_{j} / d t^{2}d2xj/dt2, to accuracy ϵ 2 N U , j ϵ 2 N U , j epsilon^(2N)U_(,j)\epsilon^{2 N} U_{, j}ϵ2NU,j for some
integer N N NNN. Show that this undertaking requires a knowledge of g 00 g 00 g_(00)g_{00}g00 to accuracy ϵ 2 N + 2 ϵ 2 N + 2 epsilon^(2N+2)\epsilon^{2 N+2}ϵ2N+2, of g 0 k g 0 k g_(0k)g_{0 k}g0k to ϵ 2 N + 1 ϵ 2 N + 1 epsilon^(2N+1)\epsilon^{2 N+1}ϵ2N+1, and of g i k g i k g_(ik)g_{i k}gik to ϵ 2 N ϵ 2 N epsilon^(2N)\epsilon^{2 N}ϵ2N. Also suppose that one knows T 00 T 00 T^(00)T^{00}T00 to accuracy ρ o ϵ 2 N , T 0 j ρ o ϵ 2 N , T 0 j rho_(o)epsilon^(2N),T^(0j)\rho_{o} \epsilon^{2 N}, T^{0 j}ρoϵ2N,T0j to ρ o ϵ 2 N + 1 ρ o ϵ 2 N + 1 rho_(o)epsilon^(2N+1)\rho_{o} \epsilon^{2 N+1}ρoϵ2N+1, and T j k T j k T^(jk)T^{j k}Tjk to ρ o ϵ 2 N + 2 ρ o ϵ 2 N + 2 rho_(o)epsilon^(2N+2)\rho_{o} \epsilon^{2 N+2}ρoϵ2N+2 [see, e.g., equations (39.13) for N = 0 N = 0 N=0N=0N=0 and (39.42) for N = 2 N = 2 N=2N=2N=2 ]. Show that to calculate T 0 α ; α T 0 α ; α T^(0alpha)_(;alpha)T^{0 \alpha}{ }_{; \alpha}T0α;α with accuracy ϵ 2 N + 1 ρ o , j ϵ 2 N + 1 ρ o , j epsilon^(2N+1)rho_(o,j)\epsilon^{2 N+1} \rho_{o, j}ϵ2N+1ρo,j and T j α ; α T j α ; α T^(j alpha)_(;alpha)T^{j \alpha}{ }_{; \alpha}Tjα;α with accuracy ϵ 2 N + 2 ρ o , j ϵ 2 N + 2 ρ o , j epsilon^(2N+2)rho_(o,j)\epsilon^{2 N+2} \rho_{o, j}ϵ2N+2ρo,j, one must know g 00 g 00 g_(00)g_{00}g00 to ϵ 2 N + 2 , g 0 k ϵ 2 N + 2 , g 0 k epsilon^(2N+2),g_(0k)\epsilon^{2 N+2}, g_{0 k}ϵ2N+2,g0k to ϵ 2 N + 1 ϵ 2 N + 1 epsilon^(2N+1)\epsilon^{2 N+1}ϵ2N+1, and g j k g j k g_(jk)g_{j k}gjk to ϵ 2 N ϵ 2 N epsilon^(2N)\epsilon^{2 N}ϵ2N. This dictates the pattern of Box 39.3.

§39.7. NEWTONIAN APPROXIMATION

At Newtonian order the metric has the form (39.10); and the 4-velocity and stress- Newtonian approximation energy tensor have components, relative to the PPN coordinate system,
(39.12) u 0 = + 1 + O ( ϵ 2 ) , u j = v j + O ( ϵ 3 ) T 00 = ρ o + O ( ρ o ϵ 2 ) , T 0 j = ρ o v j + O ( ρ o ϵ 3 ) , (39.13) T j k = t j j ^ k + ρ o v j v k + O ( ρ o ϵ 4 ) (39.12) u 0 = + 1 + O ϵ 2 , u j = v j + O ϵ 3 T 00 = ρ o + O ρ o ϵ 2 , T 0 j = ρ o v j + O ρ o ϵ 3 , (39.13) T j k = t j j ^ k + ρ o v j v k + O ρ o ϵ 4 {:[(39.12)u^(0)=+1+O(epsilon^(2))","quadu^(j)=v_(j)+O(epsilon^(3))],[T^(00)=rho_(o)+O(rho_(o)epsilon^(2))","quadT^(0j)=rho_(o)v_(j)+O(rho_(o)epsilon^(3))","],[(39.13)T^(jk)=t_(j hat(j)k)+rho_(o)v_(j)v_(k)+O(rho_(o)epsilon^(4))]:}\begin{align*} u^{0} & =+1+O\left(\epsilon^{2}\right), \quad u^{j}=v_{j}+O\left(\epsilon^{3}\right) \tag{39.12}\\ T^{00} & =\rho_{o}+O\left(\rho_{o} \epsilon^{2}\right), \quad T^{0 j}=\rho_{o} v_{j}+O\left(\rho_{o} \epsilon^{3}\right), \\ T^{j k} & =t_{j \hat{j} k}+\rho_{o} v_{j} v_{k}+O\left(\rho_{o} \epsilon^{4}\right) \tag{39.13} \end{align*}(39.12)u0=+1+O(ϵ2),uj=vj+O(ϵ3)T00=ρo+O(ρoϵ2),T0j=ρovj+O(ρoϵ3),(39.13)Tjk=tjj^k+ρovjvk+O(ρoϵ4)
(see exercise 39.3). Two sets of equations govern the structure and evolution of the solar system. (1) The Einstein field equations. As was shown in §18.4, and also in § 17.4 § 17.4 §17.4\S 17.4§17.4, in the Newtonian limit Einstein's equations reduce to Laplace's equation
(39.14a) U , j j = 4 π ρ o (39.14a) U , j j = 4 π ρ o {:(39.14a)U_(,jj)=-4pirho_(o):}\begin{equation*} U_{, j j}=-4 \pi \rho_{o} \tag{39.14a} \end{equation*}(39.14a)U,jj=4πρo
which has the "action-at-a-distance" solution
(39.14b) U ( x , t ) = ρ o ( x , t ) | x x | d 3 x (39.14b) U ( x , t ) = ρ o x , t x x d 3 x {:(39.14b)U(x","t)=int(rho_(o)(x^('),t))/(|x-x^(')|)d^(3)x^('):}\begin{equation*} U(x, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime} \tag{39.14b} \end{equation*}(39.14b)U(x,t)=ρo(x,t)|xx|d3x

Box 39.3 POST-NEWTONIAN EXPANSION OF THE METRIC COEFFICIENTS

Level of approximation
(and papers expanding general
relativity to this level)
Level of approximation (and papers expanding general relativity to this level)| Level of approximation | | :--- | | (and papers expanding general | | relativity to this level) |
Order or value of terms
Order or value of terms| Order or value of terms | | :---: |
flat, empty spacetime -1 g 00 g 00 g_(00)g_{00}g00 0
Newtonian approximation 2 U 2 U 2U2 U2U 0 g j k g j k g_(jk)g_{j k}gjk
post-Newtonian approximation
[(Fock (1959); Chandrasekhar (1965a)]
post-Newtonian approximation [(Fock (1959); Chandrasekhar (1965a)]| post-Newtonian approximation | | :--- | | [(Fock (1959); Chandrasekhar (1965a)] |
+ terms ϵ 4 ϵ 4 ∼epsilon^(4)\sim \epsilon^{4}ϵ4 + terms ϵ 3 ϵ 3 ∼epsilon^(3)\sim \epsilon^{3}ϵ3 + terms ϵ 2 ϵ 2 ∼epsilon^(2)\sim \epsilon^{2}ϵ2
post-post-Newtonian approximation
[Chandrasekhar and Nutku (1969)]
post-post-Newtonian approximation [Chandrasekhar and Nutku (1969)]| post-post-Newtonian approximation | | :--- | | [Chandrasekhar and Nutku (1969)] |
+ terms ϵ 6 ϵ 6 ∼epsilon^(6)\sim \epsilon^{6}ϵ6 + terms ϵ 5 ϵ 5 ∼epsilon^(5)\sim \epsilon^{5}ϵ5 + terms ϵ 4 ϵ 4 ∼epsilon^(4)\sim \epsilon^{4}ϵ4
radiation damping
[Chandrasekhar and Esposito (1970)]
radiation damping [Chandrasekhar and Esposito (1970)]| radiation damping | | :--- | | [Chandrasekhar and Esposito (1970)] |
+ terms ϵ 7 ϵ 7 ∼epsilon^(7)\sim \epsilon^{7}ϵ7 + terms ϵ 6 ϵ 6 ∼epsilon^(6)\sim \epsilon^{6}ϵ6 + terms ϵ 5 ϵ 5 ∼epsilon^(5)\sim \epsilon^{5}ϵ5
"Level of approximation (and papers expanding general relativity to this level)" "Order or value of terms" flat, empty spacetime -1 g_(00) 0 Newtonian approximation 2U 0 g_(jk) "post-Newtonian approximation [(Fock (1959); Chandrasekhar (1965a)]" + terms ∼epsilon^(4) + terms ∼epsilon^(3) + terms ∼epsilon^(2) "post-post-Newtonian approximation [Chandrasekhar and Nutku (1969)]" + terms ∼epsilon^(6) + terms ∼epsilon^(5) + terms ∼epsilon^(4) "radiation damping [Chandrasekhar and Esposito (1970)]" + terms ∼epsilon^(7) + terms ∼epsilon^(6) + terms ∼epsilon^(5)| Level of approximation <br> (and papers expanding general <br> relativity to this level) | Order or value of terms | | | | :--- | :---: | :---: | :---: | | flat, empty spacetime | -1 | $g_{00}$ | 0 | | Newtonian approximation | $2 U$ | 0 | $g_{j k}$ | | post-Newtonian approximation <br> [(Fock (1959); Chandrasekhar (1965a)] | + terms $\sim \epsilon^{4}$ | + terms $\sim \epsilon^{3}$ | + terms $\sim \epsilon^{2}$ | | post-post-Newtonian approximation <br> [Chandrasekhar and Nutku (1969)] | + terms $\sim \epsilon^{6}$ | + terms $\sim \epsilon^{5}$ | + terms $\sim \epsilon^{4}$ | | radiation damping <br> [Chandrasekhar and Esposito (1970)] | + terms $\sim \epsilon^{7}$ | + terms $\sim \epsilon^{6}$ | + terms $\sim \epsilon^{5}$ |
(2) The law of local energy-momentum conservation, T α β ; β = 0 T α β ; β = 0 T^(alpha beta)_(;beta)=0T^{\alpha \beta}{ }_{; \beta}=0Tαβ;β=0. The time component of this law reduces to the conservation of rest mass
(39.15a) ρ o / t + ( ρ o v j ) / x j = 0 + fractional errors of O ( ϵ 2 ) ; (39.15a) ρ o / t + ρ o v j / x j = 0 +  fractional errors of  O ϵ 2 ; {:(39.15a)delrho_(o)//del t+del(rho_(o)v_(j))//delx_(j)=0+" fractional errors of "O(epsilon^(2));:}\begin{equation*} \partial \rho_{o} / \partial t+\partial\left(\rho_{o} v_{j}\right) / \partial x_{j}=0+\text { fractional errors of } O\left(\epsilon^{2}\right) ; \tag{39.15a} \end{equation*}(39.15a)ρo/t+(ρovj)/xj=0+ fractional errors of O(ϵ2);
and the space components reduce to Newton's second law of motion, " F = m a F = m a F=maF=m aF=ma ":
(39.15b) ρ o d v j / d t = ρ o ( U / x j ) t j k / x k + fractional errors of O ( ϵ 2 ) , (39.16) d / d t (time derivative following the matter) / t + v k / x k (39.15b) ρ o d v j / d t = ρ o U / x j t j k / x k +  fractional errors of  O ϵ 2 , (39.16) d / d t  (time derivative following the matter)  / t + v k / x k {:[(39.15b)rho_(o)dv_(j)//dt=rho_(o)(del U//delx_(j))-delt_(jk)//delx_(k)+" fractional errors of "O(epsilon^(2))","],[(39.16)d//dt-=" (time derivative following the matter) "-=del//del t+v_(k)del//delx_(k)]:}\begin{gather*} \rho_{o} d v_{j} / d t=\rho_{o}\left(\partial U / \partial x_{j}\right)-\partial t_{j k} / \partial x_{k}+\text { fractional errors of } O\left(\epsilon^{2}\right), \tag{39.15b}\\ d / d t \equiv \text { (time derivative following the matter) } \equiv \partial / \partial t+v_{k} \partial / \partial x_{k} \tag{39.16} \end{gather*}(39.15b)ρodvj/dt=ρo(U/xj)tjk/xk+ fractional errors of O(ϵ2),(39.16)d/dt (time derivative following the matter) /t+vk/xk
(see exercise 39.3).
Equations (39.14)-(39.16), together with equations of state describing the planetary and solar matter, are the foundations for all Newtonian calculations of the structure and motion of the sun and planets. Notice that the internal energy density ρ 0 Π ρ 0 Π rho_(0)Pi\rho_{0} \Piρ0Π nowhere enters into these equations. It is of no importance to Newtonian hydrodynamics. It matters for the sun's thermal-energy balance; but that is irrelevant here.

EXERCISES

Exercise 39.3. NEWTONIAN APPROXIMATION

(a) Derive equations (39.13) for the components of the stress-energy tensor in the PPN coordinate frame. [Hint: In the rest frame of the matter ("comoving orthonormal frame") T 0 ^ 0 ^ = ρ = ρ o + O ( ϵ 2 ) , T 0 ^ j ^ = 0 , T j ^ k ^ = t j ^ k T 0 ^ 0 ^ = ρ = ρ o + O ϵ 2 , T 0 ^ j ^ = 0 , T j ^ k ^ = t j ^ k T_( hat(0) hat(0))=rho=rho_(o)+O(epsilon^(2)),T_( hat(0) hat(j))=0,T_( hat(j) hat(k))=t_( hat(j)k)T_{\hat{0} \hat{0}}=\rho=\rho_{o}+O\left(\epsilon^{2}\right), T_{\hat{0} \hat{j}}=0, T_{\hat{j} \hat{k}}=t_{\hat{j} k}T0^0^=ρ=ρo+O(ϵ2),T0^j^=0,Tj^k^=tj^k; see equations (39.5). Lorentz-transform these components by a pure boost with ordinary velocity v j v j -v_(j)-v_{j}vj to obtain T α β T α β T_(alpha beta)T_{\alpha \beta}Tαβ.]
(b) Show that, in the PPN coordinate frame, T 0 α ; α = 0 T 0 α ; α = 0 T^(0alpha)_(;alpha)=0T^{0 \alpha}{ }_{; \alpha}=0T0α;α=0 reduces to equation (39.15a), and T ; α j α = 0 T ; α j α = 0 T_(;alpha)^(j alpha)=0T_{; \alpha}^{j \alpha}=0T;αjα=0, when combined with (39.15a), reduces to equation (39.15b).]

Exercise 39.4. A USEFUL FORMULA

Derive from equations (39.15) the following useful formula, valid for any function f ( x , t ) f ( x , t ) f(x,t)f(x, t)f(x,t) :
(39.17) d d t ρ o ( x , t ) f ( x , t ) d 3 x = ρ o ( x , t ) d f ( x , t ) d t d 3 x + fractional errors of O ( ϵ 2 ) . (39.17) d d t ρ o ( x , t ) f ( x , t ) d 3 x = ρ o ( x , t ) d f ( x , t ) d t d 3 x +  fractional errors of  O ϵ 2 . {:[(39.17)(d)/(dt)intrho_(o)(x","t)f(x","t)d^(3)x= intrho_(o)(x","t)(df(x,t))/(dt)d^(3)x],[+" fractional errors of "O(epsilon^(2)).]:}\begin{align*} \frac{d}{d t} \int \rho_{o}(x, t) f(x, t) d^{3} x= & \int \rho_{o}(x, t) \frac{d f(x, t)}{d t} d^{3} x \tag{39.17}\\ & + \text { fractional errors of } O\left(\epsilon^{2}\right) . \end{align*}(39.17)ddtρo(x,t)f(x,t)d3x=ρo(x,t)df(x,t)dtd3x+ fractional errors of O(ϵ2).
Here both integrals are extended over all of space; and d f / d t d f / d t df//dtd f / d tdf/dt is the derivative following the matter (39.16).

Exercise 39.5. STRESS TENSOR FOR NEWTONIAN GRAVITATIONAL FIELD

Define a "stress tensor for the Newtonian gravitational field U U UUU " as follows:
(39.18) t j k 1 4 π ( U , j U , k 1 2 δ j k U , l U , l ) . (39.18) t j k 1 4 π U , j U , k 1 2 δ j k U , l U , l . {:(39.18)t_(jk)-=(1)/(4pi)(U_(,j)U_(,k)-(1)/(2)delta_(jk)U_(,l)U_(,l)).:}\begin{equation*} \mathrm{t}_{j k} \equiv \frac{1}{4 \pi}\left(U_{, j} U_{, k}-\frac{1}{2} \delta_{j k} U_{, l} U_{, l}\right) . \tag{39.18} \end{equation*}(39.18)tjk14π(U,jU,k12δjkU,lU,l).
Show that the equations of motion for the matter (39.15b) can be rewritten in the forms
(39.19) ρ o d v j d t = x k ( t j k + t j k ) + fractional errors of O ( ϵ 2 ) , (39.19') ( ρ o v j ) , t + ( t j k + t j k + ρ o v j v k ) , k = 0 + fractional errors of O ( ϵ 2 ) . (39.19) ρ o d v j d t = x k t j k + t j k +  fractional errors of  O ϵ 2 , (39.19') ρ o v j , t + t j k + t j k + ρ o v j v k , k = 0 +  fractional errors of  O ϵ 2 . {:[(39.19)rho_(o)(dv_(j))/(dt)=-(del)/(delx^(k))(t_(jk)+t_(jk))+" fractional errors of "O(epsilon^(2))","],[(39.19')(rho_(o)v_(j))_(,t)+(t_(jk)+t_(jk)+rho_(o)v_(j)v_(k))_(,k)=0+" fractional errors of "O(epsilon^(2)).]:}\begin{gather*} \rho_{o} \frac{d v_{j}}{d t}=-\frac{\partial}{\partial x^{k}}\left(\mathrm{t}_{j k}+t_{j k}\right)+\text { fractional errors of } O\left(\epsilon^{2}\right), \tag{39.19}\\ \left(\rho_{o} v_{j}\right)_{, t}+\left(\mathrm{t}_{j k}+t_{j k}+\rho_{o} v_{j} v_{k}\right)_{, k}=0+\text { fractional errors of } O\left(\epsilon^{2}\right) . \tag{39.19'} \end{gather*}(39.19)ρodvjdt=xk(tjk+tjk)+ fractional errors of O(ϵ2),(39.19')(ρovj),t+(tjk+tjk+ρovjvk),k=0+ fractional errors of O(ϵ2).

Exercise 39.6. NEWTONIAN VIRIAL THEOREMS

(a) From equation ( 39.19 ) 39.19 (39.19^('))\left(39.19^{\prime}\right)(39.19) show that
(39.20a) d 2 I j k / d t 2 = 2 ( t j k + t j k + ρ o v j v k ) d 3 x + fractional errors of O ( ϵ 2 ) (39.20a) d 2 I j k / d t 2 = 2 t j k + t j k + ρ o v j v k d 3 x +  fractional errors of  O ϵ 2 {:(39.20a)d^(2)I_(jk)//dt^(2)=2int(t_(jk)+t_(jk)+rho_(o)v_(j)v_(k))d^(3)x+" fractional errors of "O(epsilon^(2)):}\begin{equation*} d^{2} I_{j k} / d t^{2}=2 \int\left(\mathrm{t}_{j k}+t_{j k}+\rho_{o} v_{j} v_{k}\right) d^{3} x+\text { fractional errors of } O\left(\epsilon^{2}\right) \tag{39.20a} \end{equation*}(39.20a)d2Ijk/dt2=2(tjk+tjk+ρovjvk)d3x+ fractional errors of O(ϵ2)
where I j k I j k I_(jk)I_{j k}Ijk is the second moment of the system's mass distribution,
I j k = ρ o x j x k d 3 x I j k = ρ o x j x k d 3 x I_(jk)=intrho_(o)x_(j)x_(k)d^(3)xI_{j k}=\int \rho_{o} x_{j} x_{k} d^{3} xIjk=ρoxjxkd3x
This is called the "time-dependent tensor virial theorem."
(b) From this infer that, if longtime longtime  (:quad:)_("longtime ")\langle\quad\rangle_{\text {longtime }}longtime  denotes an average over a long period of time, then
(39.20b) ( t j k + t 3 k ^ + ρ o v j v k ) d 3 x long time = O ( ρ o ϵ 4 d 3 x ) . (39.20b) t j k + t 3 k ^ + ρ o v j v k d 3 x long time  = O ρ o ϵ 4 d 3 x . {:(39.20b)(:int(t_(jk)+t_(3 hat(k))+rho_(o)v_(j)v_(k))d^(3)x:)_("long time ")=O(intrho_(o)epsilon^(4)d^(3)x).:}\begin{equation*} \left\langle\int\left(\mathrm{t}_{j k}+t_{3 \hat{k}}+\rho_{o} v_{j} v_{k}\right) d^{3} x\right\rangle_{\text {long time }}=O\left(\int \rho_{o} \epsilon^{4} d^{3} x\right) . \tag{39.20b} \end{equation*}(39.20b)(tjk+t3k^+ρovjvk)d3xlong time =O(ρoϵ4d3x).
This is called the "tensor virial theorem."
(c) By contraction of indices and use of equations (39.18), (39.14a), and (39.5e), derive the (ordinary) virial theorems:
1 2 d 2 I / d t 2 = ρ o v 2 d 3 x 1 2 ρ o U d 3 x + 3 p d 3 x + 1 2 d 2 I / d t 2 = ρ o v 2 d 3 x 1 2 ρ o U d 3 x + 3 p d 3 x + (1)/(2)d^(2)I//dt^(2)=intrho_(o)v^(2)d^(3)x-int(1)/(2)rho_(o)Ud^(3)x+3int pd^(3)x+\frac{1}{2} d^{2} I / d t^{2}=\int \rho_{o} v^{2} d^{3} x-\int \frac{1}{2} \rho_{o} U d^{3} x+3 \int p d^{3} x+12d2I/dt2=ρov2d3x12ρoUd3x+3pd3x+ fractional errors of O ( ϵ 2 ) O ϵ 2 O(epsilon^(2))O\left(\epsilon^{2}\right)O(ϵ2),
where I I III is the trace of the second moment of the mass distribution
I = I j j = ρ o r 2 d 3 x I = I j j = ρ o r 2 d 3 x I=I_(jj)=intrho_(o)r^(2)d^(3)xI=I_{j j}=\int \rho_{o} r^{2} d^{3} xI=Ijj=ρor2d3x
and
(39.2lb) ρ 0 v 2 d 3 x 1 2 ρ 0 U d 3 x kinetic + 3 p d 3 x ( gravitational energy ) + 3 × ( pressure integral ) longtime = O ( ρ 0 ϵ 4 d 3 x ) 2 × ( (39.2lb) ρ 0 v 2 d 3 x 1 2 ρ 0 U d 3 x kinetic  + 3 p d 3 x ( gravitational   energy  ) + 3 × (  pressure   integral  ) longtime  = O ρ 0 ϵ 4 d 3 x 2 × ( {:[(39.2lb)(:ubrace(intrho_(0)v^(2)d^(3)xubrace)-ubrace((1)/(2)intrho_(0)Ud^(3)xubrace)_("kinetic ")+ubrace(3int pd^(3)xubrace)_((("gravitational ")/(" energy "))+3xx((" pressure ")/(" integral "))):)_("longtime ")=O(intrho_(0)epsilon^(4)d^(3)x)],[2xx(]:}\begin{align*} & \langle\underbrace{\int \rho_{0} v^{2} d^{3} x}-\underbrace{\frac{1}{2} \int \rho_{0} U d^{3} x}_{\text {kinetic }}+\underbrace{3 \int p d^{3} x}_{\binom{\text {gravitational }}{\text { energy }}+3 \times\binom{\text { pressure }}{\text { integral }}}\rangle_{\text {longtime }}=O\left(\int \rho_{0} \epsilon^{4} d^{3} x\right) \tag{39.2lb}\\ & 2 \times( \end{align*}(39.2lb)ρ0v2d3x12ρ0Ud3xkinetic +3pd3x(gravitational  energy )+3×( pressure  integral )longtime =O(ρ0ϵ4d3x)2×(

Exercise 39.7. PULSATION FREQUENCY FOR NEWTONIAN STAR

Use the ordinary, time-dependent virial theorem (39.21a) to derive the following equation for the fundamental angular frequency of pulsation of a nonrotating, Newtonian star:
(39.22a) ω 2 = ( 3 Γ ¯ 1 4 ) star's self-gravitational energy ( trace of second moment of star's mass distribution ) (39.22b) Γ ¯ 1 = ( pressure-weighted average of adiabatic index ) Γ 1 p d 3 x p d 3 x (39.22a) ω 2 = 3 Γ ¯ 1 4  star's self-gravitational energy  (  trace of second moment of star's mass distribution  ) (39.22b) Γ ¯ 1 = (  pressure-weighted average   of adiabatic index  ) Γ 1 p d 3 x p d 3 x {:[(39.22a)omega^(2)=(3 bar(Gamma)_(1)-4)(∣" star's self-gravitational energy "∣)/((" trace of second moment of star's mass distribution "))],[(39.22b) bar(Gamma)_(1)=((" pressure-weighted average ")/(" of adiabatic index "))-=(intGamma_(1)pd^(3)x)/(int pd^(3)x)]:}\begin{gather*} \omega^{2}=\left(3 \bar{\Gamma}_{1}-4\right) \frac{\mid \text { star's self-gravitational energy } \mid}{(\text { trace of second moment of star's mass distribution })} \tag{39.22a}\\ \bar{\Gamma}_{1}=\binom{\text { pressure-weighted average }}{\text { of adiabatic index }} \equiv \frac{\int \Gamma_{1} p d^{3} x}{\int p d^{3} x} \tag{39.22b} \end{gather*}(39.22a)ω2=(3Γ¯14) star's self-gravitational energy ( trace of second moment of star's mass distribution )(39.22b)Γ¯1=( pressure-weighted average  of adiabatic index )Γ1pd3xpd3x
In the derivation assume that the pulsations are "homologous"-i.e., that a fluid element with equilibrium position x j x j x^(j)x^{j}xj (relative to center of mass x j = 0 x j = 0 x^(j)=0x^{j}=0xj=0 ) gets displaced to x j + ξ j ( x , t ) x j + ξ j ( x , t ) x^(j)+xi^(j)(x,t)x^{j}+\xi^{j}(x, t)xj+ξj(x,t), where
ξ j = ( small constant ) x j e i ω t . ξ j = (  small constant  ) x j e i ω t . xi^(j)=(" small constant ")x^(j)e^(-i omega t).\xi^{j}=(\text { small constant }) x^{j} e^{-i \omega t} .ξj=( small constant )xjeiωt.
Assume nothing else. Notes: (1) The result (39.22) was derived differently in Box 26.2 and used in $ 24.4 $ 24.4 $24.4\$ 24.4$24.4. (2) The assumption of homologous pulsation is fully justified if | Γ 1 4 / 3 | = Γ 1 4 / 3 = |Gamma_(1)-4//3|=\left|\Gamma_{1}-4 / 3\right|=|Γ14/3|= constant 1 1 ≪1\ll 11; see Box 26.2 . (3) The result (39.22) is readily generalized to slowly
rotating Newtonian stars; see, e.g., Chandrasekhar and Lebovitz (1968). It can also be generalized to nonrotating post-Newtonian stars using general relativity (Box 26.2), or using the PPN formalism for any metric theory [ Ni (1973)]. And it can be generalized to slowly rotating, post-Newtonian stars [see, e.g., Chandrasekhar and Lebovitz (1968)].

§39.8. PPN METRIC COEFFICIENTS

Post-Newtonian corrections to metric:
(1) rules governing forms
(2) construction of corrections
The post-Newtonian corrections k α β k α β k_(alpha beta)k_{\alpha \beta}kαβ to the metric coefficients g α β g α β g_(alpha beta)g_{\alpha \beta}gαβ are calculated, in any metric theory of gravity, by lengthy manipulations of the field equations. (See, e.g., exercise 39.14 near the end of this chapter for general relativity.) But without ever picking some one theory, and without ever writing down any set of field equations, one can infer the forms of the post-Newtonian corrections k α β k α β k_(alpha beta)k_{\alpha \beta}kαβ. Their forms are fixed by the following constraints: (1) They must be of post-Newtonian order ( k 00 ϵ 4 , k 0 j ϵ 3 , k i j ϵ 2 k 00 ϵ 4 , k 0 j ϵ 3 , k i j ϵ 2 k_(00)∼epsilon^(4),k_(0j)∼epsilon^(3),k_(ij)∼epsilon^(2)k_{00} \sim \boldsymbol{\epsilon}^{4}, k_{0 j} \sim \epsilon^{3}, k_{i j} \sim \boldsymbol{\epsilon}^{2}k00ϵ4,k0jϵ3,kijϵ2 ). (2) They must be dimensionless. (3) k 00 k 00 k_(00)k_{00}k00 must be a scalar under rotations, k 0 j k 0 j k_(0j)k_{0 j}k0j must be components of a 3 -vector, and k j k k j k k_(jk)k_{j k}kjk must be components of a 3-tensor. (4) The corrections must die out at least as fast as 1 / r 1 / r 1//r1 / r1/r far from the solar system, so that the coordinates become globally Lorentz and spacetime becomes flat at r = r = r=oor=\inftyr=. (5) For simplicity, one can assume that the metric components are generated only by ρ o , ρ o Π , t i j ^ , p ρ o , ρ o Π , t i j ^ , p rho_(o),rho_(o)Pi,t_(i hat(j)),p\rho_{o}, \rho_{o} \Pi, t_{i \hat{j}}, pρo,ρoΠ,tij^,p, products of these with the velocity v j v j v_(j)v_{j}vj, and time-derivatives of such quantities;* but not by their spatial gradients. [This assumption of simplicity is satisfied by all metric theories examined up to 1973, except Whitehead (1922) and theories reviewed by Will (1973)—which disagree with experiment.] Note the further justification for this assumption in exercise 39.8.
Begin with the corrections to the spatial components, k i j ϵ 2 k i j ϵ 2 k_(ij)∼epsilon^(2)k_{i j} \sim \epsilon^{2}kijϵ2. There are only two functionals of ρ o , p , Π , t j k , v j ρ o , p , Π , t j k , v j rho_(o),p,Pi,t_(jk),v_(j)\rho_{o}, p, \Pi, t_{j k}, v_{j}ρo,p,Π,tjk,vj, that die out at least as fast as 1 / r 1 / r 1//r1 / r1/r, are dimensionless, are O ( ϵ 2 ) O ϵ 2 O(epsilon^(2))O\left(\epsilon^{2}\right)O(ϵ2), and are second-rank, symmetric 3-tensors; they are
(39.23a) δ i j U ( x , t ) ; U i j ( x , t ) = ρ o ( x , t ) ( x i x i ) ( x j x j ) | x x | 3 d 3 x . (39.23a) δ i j U ( x , t ) ; U i j ( x , t ) = ρ o x , t x i x i x j x j x x 3 d 3 x . {:(39.23a)delta_(ij)U(x","t);quadU_(ij)(x","t)=int(rho_(o)(x^('),t)(x_(i)-x_(i)^('))(x_(j)-x_(j)^(')))/(|x-x^(')|^(3))d^(3)x^(').:}\begin{equation*} \delta_{i j} U(\boldsymbol{x}, t) ; \quad U_{i j}(\boldsymbol{x}, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)\left(x_{i}-x_{i}{ }^{\prime}\right)\left(x_{j}-x_{j}{ }^{\prime}\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|^{3}} d^{3} x^{\prime} . \tag{39.23a} \end{equation*}(39.23a)δijU(x,t);Uij(x,t)=ρo(x,t)(xixi)(xjxj)|xx|3d3x.
Thus, k i j k i j k_(ij)k_{i j}kij must be k i j = 2 γ δ i j U + 2 Γ U i j k i j = 2 γ δ i j U + 2 Γ U i j k_(ij)=2gammadelta_(ij)U+2GammaU_(ij)k_{i j}=2 \gamma \delta_{i j} U+2 \Gamma U_{i j}kij=2γδijU+2ΓUij, for some constant "PPN parameters" γ γ gamma\gammaγ and Γ Γ Gamma\GammaΓ. By an infinitesimal coordinate transformation [ x i NEW = x i OLD + Γ χ / x i x i NEW = x i OLD + Γ χ / x i [x_(iNEW)=x_(iOLD)+Gamma del chi//delx_(i):}\left[x_{i \mathrm{NEW}}=x_{i \mathrm{OLD}}+\Gamma \partial \chi / \partial x_{i}\right.[xiNEW=xiOLD+Γχ/xi, with χ ( x , t ) = ρ o ( x , t ) | x x | d 3 x ] χ ( x , t ) = ρ o x , t x x d 3 x {: chi(x,t)=-intrho_(o)(x^('),t)|x-x^(')|d^(3)x^(')]\left.\chi(\boldsymbol{x}, t)=-\int \rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right| d^{3} x^{\prime}\right]χ(x,t)=ρo(x,t)|xx|d3x] one can set Γ = 0 Γ = 0 Gamma=0\Gamma=0Γ=0, thereby obtaining
(39.23b) g i j = δ i j + k i j = δ i j ( 1 + 2 γ U ) + O ( ϵ 4 ) (39.23b) g i j = δ i j + k i j = δ i j ( 1 + 2 γ U ) + O ϵ 4 {:(39.23b)g_(ij)=delta_(ij)+k_(ij)=delta_(ij)(1+2gamma U)+O(epsilon^(4)):}\begin{equation*} g_{i j}=\delta_{i j}+k_{i j}=\delta_{i j}(1+2 \gamma U)+O\left(\epsilon^{4}\right) \tag{39.23b} \end{equation*}(39.23b)gij=δij+kij=δij(1+2γU)+O(ϵ4)
*One allows for time derivatives because retarded integrals contain such terms when expanded to post-Newtonian order; thus,
ρ o ( x , t | x x | ) | x x | d 3 x = [ ρ o ( x , t ) | x x | ρ o ( x , t ) t + ] d 3 x ρ o x , t x x x x d 3 x = ρ o x , t x x ρ o x , t t + d 3 x int(rho_(o)(x^('),t-|x-x^(')|))/(|x-x^(')|)d^(3)x^(')=int[(rho_(o)(x^('),t))/(|x-x^(')|)-(delrho_(o)(x^('),t))/(del t)+cdots]d^(3)x^(')\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t-\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime}=\int\left[\frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|}-\frac{\partial \rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)}{\partial t}+\cdots\right] d^{3} x^{\prime}ρo(x,t|xx|)|xx|d3x=[ρo(x,t)|xx|ρo(x,t)t+]d3x
However, it turns out that, with a suitable choice of coordinates ("gauge"), all time-odd retarded terms [e.g., ( ρ o / t ) d 3 x ρ o / t d 3 x int(delrho_(o)//del t)d^(3)x\int\left(\partial \rho_{o} / \partial t\right) d^{3} x(ρo/t)d3x ] vanish, except at "the post 5 / 2 5 / 2 ^(5//2){ }^{5 / 2}5/2-Newtonian order" and at higher orders of approximation; there they lead to radiation damping (see Box 39.3). For example, ( ρ o / t ) d 3 x = ( d / d t ) ρ o d 3 x ρ o / t d 3 x = ( d / d t ) ρ o d 3 x int(delrho_(o)//del t)d^(3)x=(d//dt)intrho_(o)d^(3)x\int\left(\partial \rho_{o} / \partial t\right) d^{3} x=(d / d t) \int \rho_{o} d^{3} x(ρo/t)d3x=(d/dt)ρod3x vanishes by virtue of the conservation of baryon number.
Next consider k 0 j ϵ 3 k 0 j ϵ 3 k_(0j)∼epsilon^(3)k_{0 j} \sim \epsilon^{3}k0jϵ3. Trial and error yield only two vector functionals that die out as 1 / r 1 / r 1//r1 / r1/r or faster, are dimensionless, and are O ( ϵ 3 ) O ϵ 3 O(epsilon^(3))O\left(\epsilon^{3}\right)O(ϵ3). They are
(39.23c) V j ( x , t ) = ρ o ( x , t ) v j ( x , t ) | x x | d 3 x , (39.23d) W j ( x , t ) = ρ o ( x , t ) [ ( x x ) v ( x , t ) ] ( x j x j ) d 3 x | x x | 3 (39.23c) V j ( x , t ) = ρ o x , t v j x , t x x d 3 x , (39.23d) W j ( x , t ) = ρ o x , t x x v x , t x j x j d 3 x x x 3 {:[(39.23c)V_(j)(x","t)=int(rho_(o)(x^('),t)v_(j)(x^('),t))/(|x-x^(')|)d^(3)x^(')","],[(39.23d)W_(j)(x","t)=int(rho_(o)(x^('),t)[(x-x^('))*v(x^('),t)](x_(j)-x_(j)^('))d^(3)x^('))/(|x-x^(')|^(3))]:}\begin{gather*} V_{j}(\boldsymbol{x}, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right) v_{j}\left(\boldsymbol{x}^{\prime}, t\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime}, \tag{39.23c}\\ W_{j}(\boldsymbol{x}, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)\left[\left(\boldsymbol{x}-\boldsymbol{x}^{\prime}\right) \cdot \boldsymbol{v}\left(\boldsymbol{x}^{\prime}, t\right)\right]\left(x_{j}-x_{j}^{\prime}\right) d^{3} x^{\prime}}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|^{3}} \tag{39.23d} \end{gather*}(39.23c)Vj(x,t)=ρo(x,t)vj(x,t)|xx|d3x,(39.23d)Wj(x,t)=ρo(x,t)[(xx)v(x,t)](xjxj)d3x|xx|3
Thus, k 0 j k 0 j k_(0j)k_{0 j}k0j must be a linear combination of these, involving unknown constants (PPN parameters) Δ 1 Δ 1 Delta_(1)\Delta_{1}Δ1 and Δ 2 Δ 2 Delta_(2)\Delta_{2}Δ2 :
(39.23e) g 0 j = k 0 j = 7 2 Δ 1 V j 1 2 Δ 2 W j + O ( ϵ 5 ) (39.23e) g 0 j = k 0 j = 7 2 Δ 1 V j 1 2 Δ 2 W j + O ϵ 5 {:(39.23e)g_(0j)=k_(0j)=-(7)/(2)Delta_(1)V_(j)-(1)/(2)Delta_(2)W_(j)+O(epsilon^(5)):}\begin{equation*} g_{0 j}=k_{0 j}=-\frac{7}{2} \Delta_{1} V_{j}-\frac{1}{2} \Delta_{2} W_{j}+O\left(\epsilon^{5}\right) \tag{39.23e} \end{equation*}(39.23e)g0j=k0j=72Δ1Vj12Δ2Wj+O(ϵ5)
Finally consider k 00 ϵ 4 k 00 ϵ 4 k_(00)∼epsilon^(4)k_{00} \sim \epsilon^{4}k00ϵ4. Trial and error yields a variety of terms, which can all be combined together with the Newtonian part of g 00 g 00 g_(00)g_{00}g00 to give
(39.23f) g 00 = 1 + 2 U + k 00 = 1 + 2 U 2 β U 2 + 4 Ψ ζ a η D , (39.23f) g 00 = 1 + 2 U + k 00 = 1 + 2 U 2 β U 2 + 4 Ψ ζ a η D , {:(39.23f)g_(00)=-1+2U+k_(00)=-1+2U-2betaU^(2)+4Psi-zeta a-etaD",":}\begin{equation*} g_{00}=-1+2 U+k_{00}=-1+2 U-2 \beta U^{2}+4 \Psi-\zeta a-\eta \mathscr{D}, \tag{39.23f} \end{equation*}(39.23f)g00=1+2U+k00=1+2U2βU2+4ΨζaηD,
where
(39.23~g) Ψ ( x , t ) = ρ o ( x , t ) ψ ( x , t ) | x x | d 3 x , ψ = β 1 v 2 + β 2 U + 1 2 β 3 Π + 3 2 β 4 p / ρ o , (39.23h) a ( x , t ) = ρ o ( x , t ) [ ( x x ) v ( x , t ) ] 2 | x x | 3 d 3 x , (39.23i) D ( x , t ) = [ t 3 k ( x , t ) 1 3 δ j k t ( x , t ) ] ( x j x j ) ( x k x k ) | x x | 3 d 3 x . (39.23~g) Ψ ( x , t ) = ρ o x , t ψ x , t x x d 3 x , ψ = β 1 v 2 + β 2 U + 1 2 β 3 Π + 3 2 β 4 p / ρ o , (39.23h) a ( x , t ) = ρ o x , t x x v x , t 2 x x 3 d 3 x , (39.23i) D ( x , t ) = t 3 k x , t 1 3 δ j k t x , t x j x j x k x k x x 3 d 3 x . {:[(39.23~g)Psi(x","t)=int(rho_(o)(x^('),t)psi(x^('),t))/(|x-x^(')|)d^(3)x^(')","],[psi=beta_(1)v^(2)+beta_(2)U+(1)/(2)beta_(3)Pi+(3)/(2)beta_(4)p//rho_(o)","],[(39.23h)a(x","t)=int(rho_(o)(x^('),t)[(x-x^('))*v(x^('),t)]^(2))/(|x-x^(')|^(3))d^(3)x^(')","],[(39.23i)D(x","t)=int([t_(3k)(x^('),t)-(1)/(3)delta_(jk)t_(ℓℓ)(x^('),t)](x_(j)-x_(j)^('))(x_(k)-x_(k)))/(|x-x^(')|^(3))d^(3)x^(').]:}\begin{gather*} \Psi(\boldsymbol{x}, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right) \psi\left(\boldsymbol{x}^{\prime}, t\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime}, \tag{39.23~g}\\ \psi=\beta_{1} \boldsymbol{v}^{2}+\beta_{2} U+\frac{1}{2} \beta_{3} \Pi+\frac{3}{2} \beta_{4} p / \rho_{o}, \\ a(x, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)\left[\left(\boldsymbol{x}-\boldsymbol{x}^{\prime}\right) \cdot \boldsymbol{v}\left(\boldsymbol{x}^{\prime}, t\right)\right]^{2}}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|^{3}} d^{3} x^{\prime}, \tag{39.23h}\\ \mathscr{D}(\boldsymbol{x}, t)=\int \frac{\left[t_{3 k}\left(\boldsymbol{x}^{\prime}, t\right)-\frac{1}{3} \delta_{j k} t_{\ell \ell}\left(\boldsymbol{x}^{\prime}, t\right)\right]\left(x_{j}-x_{j}^{\prime}\right)\left(x_{k}-x_{k}\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|^{3}} d^{3} x^{\prime} . \tag{39.23i} \end{gather*}(39.23~g)Ψ(x,t)=ρo(x,t)ψ(x,t)|xx|d3x,ψ=β1v2+β2U+12β3Π+32β4p/ρo,(39.23h)a(x,t)=ρo(x,t)[(xx)v(x,t)]2|xx|3d3x,(39.23i)D(x,t)=[t3k(x,t)13δjkt(x,t)](xjxj)(xkxk)|xx|3d3x.
Also, β , β 1 , β 2 , β 3 , β 4 , ζ , η β , β 1 , β 2 , β 3 , β 4 , ζ , η beta,beta_(1),beta_(2),beta_(3),beta_(4),zeta,eta\beta, \beta_{1}, \beta_{2}, \beta_{3}, \beta_{4}, \zeta, \etaβ,β1,β2,β3,β4,ζ,η are unknown constants (PPN parameters). Elsewhere in the literature the term η Q η Q -eta^(Q)-\eta^{\mathscr{Q}}ηQ in g 00 g 00 g_(00)g_{00}g00 is ignored (see footnote on p . 1075).
Yet another term is possible: one could have set
g 00 = [ value in equation ( 39.23 f ) ] (39.24) Σ ρ o ( x , t ) ρ o ( x , t ) [ ( x x ) ( x x ) ] | x x | | x x | 3 d 3 x d 3 x g 00 = [  value in equation  ( 39.23 f ) ] (39.24) Σ ρ o x , t ρ o x , t x x x x x x x x 3 d 3 x d 3 x {:[g_(00)=[" value in equation "(39.23f)]],[(39.24)-Sigma∬(rho_(o)(x^('),t)rho_(o)(x^(''),t)[(x-x^('))*(x^(')-x^(''))])/(|x-x^(')||x^(')-x^('')|^(3))d^(3)x^(')d^(3)x^('')]:}\begin{align*} g_{00}= & {[\text { value in equation }(39.23 \mathrm{f})] } \\ & -\Sigma \iint \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right) \rho_{o}\left(\boldsymbol{x}^{\prime \prime}, t\right)\left[\left(\boldsymbol{x}-\boldsymbol{x}^{\prime}\right) \cdot\left(\boldsymbol{x}^{\prime}-\boldsymbol{x}^{\prime \prime}\right)\right]}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|\left|\boldsymbol{x}^{\prime}-\boldsymbol{x}^{\prime \prime}\right|^{3}} d^{3} x^{\prime} d^{3} x^{\prime \prime} \tag{39.24} \end{align*}g00=[ value in equation (39.23f)](39.24)Σρo(x,t)ρo(x,t)[(xx)(xx)]|xx||xx|3d3xd3x
where Σ Σ Sigma\SigmaΣ is another PPN parameter. [It can be shown, using the Newtonian equations (39.14)-(39.16), that this expression dies out as 1 / r 1 / r 1//r1 / r1/r far from the solar system.] If
Rigidity of coordinate system
Summary of PPN metric and parameters
such a Σ Σ Sigma\SigmaΣ term had been included, then one could have removed it by making the infinitesimal coordinate transformation
(39.25) x new 0 = x old 0 1 2 Σ ρ o ( x , t ) [ ( x x ) v ( x , t ) ] | x x | d 3 x (39.25) x new  0 = x old  0 1 2 Σ ρ o x , t x x v x , t x x d 3 x {:(39.25)x_("new ")^(0)=x_("old ")^(0)-(1)/(2)Sigma int(rho_(o)(x^('),t)[(x-x^('))*v(x^('),t)])/(|x-x^(')|)d^(3)x^('):}\begin{equation*} x_{\text {new }}^{0}=x_{\text {old }}^{0}-\frac{1}{2} \Sigma \int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)\left[\left(\boldsymbol{x}-\boldsymbol{x}^{\prime}\right) \cdot \boldsymbol{v}\left(\boldsymbol{x}^{\prime}, t\right)\right]}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime} \tag{39.25} \end{equation*}(39.25)xnew 0=xold 012Σρo(x,t)[(xx)v(x,t)]|xx|d3x
(see exercise 39.9). Thus, there is no necessity to include the Σ Σ Sigma\SigmaΣ term.
The absence of the Σ Σ Sigma\SigmaΣ term from g 00 g 00 g_(00)g_{00}g00 means that the time coordinate has been fixed rigidly up through post-Newtonian order:
(39.26a) x 0 has uncertainties only of O ( R ϵ 5 ) 10 14 seconds. (39.26a) x 0  has uncertainties only of  O R ϵ 5 10 14  seconds.  {:(39.26a)x^(0)" has uncertainties only of "O(R_(o.)epsilon^(5))∼10^(-14)" seconds. ":}\begin{equation*} x^{0} \text { has uncertainties only of } O\left(R_{\odot} \epsilon^{5}\right) \sim 10^{-14} \text { seconds. } \tag{39.26a} \end{equation*}(39.26a)x0 has uncertainties only of O(Rϵ5)1014 seconds. 
The space coordinates are also fixed rigidly through post-Newtonian order:
(39.26b) x j has uncertainties only of O ( R ϵ 4 ) 0.1 cm , (39.26b) x j  has uncertainties only of  O R ϵ 4 0.1 cm , {:(39.26b)x^(j)" has uncertainties only of "O(R_(o.)epsilon^(4))∼0.1cm",":}\begin{equation*} x^{j} \text { has uncertainties only of } O\left(R_{\odot} \epsilon^{4}\right) \sim 0.1 \mathrm{~cm}, \tag{39.26b} \end{equation*}(39.26b)xj has uncertainties only of O(Rϵ4)0.1 cm,
because any transformation of the form
x j new = x j old + position-dependent terms of O ( ϵ 2 R ) x j new  = x j old  +  position-dependent terms of  O ϵ 2 R x^(j)_("new ")=x^(j)_("old ")+" position-dependent terms of "O(epsilon^(2)R_(o.))x^{j}{ }_{\text {new }}=x^{j}{ }_{\text {old }}+\text { position-dependent terms of } O\left(\epsilon^{2} R_{\odot}\right)xjnew =xjold + position-dependent terms of O(ϵ2R)
would destroy the form ( 39.23 b ) of the space part of the metric.
In summary, for almost every metric theory of gravity yet invented, accurate through post-Newtonian order the metric coefficients have the form (39.23). One theory is distinguished from another by the values of its ten "post-Newtonian parameters" β , β 1 , β 2 , β 3 , β 4 , γ , ζ , η , Δ 1 β , β 1 , β 2 , β 3 , β 4 , γ , ζ , η , Δ 1 beta,beta_(1),beta_(2),beta_(3),beta_(4),gamma,zeta,eta,Delta_(1)\beta, \beta_{1}, \beta_{2}, \beta_{3}, \beta_{4}, \gamma, \zeta, \eta, \Delta_{1}β,β1,β2,β3,β4,γ,ζ,η,Δ1 and Δ 2 Δ 2 Delta_(2)\Delta_{2}Δ2. These are determined by comparing the field equations of the given theory with the form (39.23) of the post-Newtonian metric. The parameter values for general relativity and for several other theories are given in Box 39.2 , along with a heuristic description of each parameter.

EXERCISES

Exercise 39.8. ABSENCE OF "METRIC-GENERATES-METRIC" TERMS IN POST-NEWTONIAN LIMIT

In writing down the post-Newtonian metric corrections, one might be tempted to include terms that are generated by the Newtonian potential acting alone, without any direct aid from the matter. After all, general relativity and other metric theories are nonlinear; so the two-step process (matter) U U longrightarrow U longrightarrow\longrightarrow U \longrightarrowU (post-Newtonian metric corrections) seems quite natural. Show that such terms are not needed, because the equations (39.14)-(39.16) of the Newtonian approximation enable one to reexpress them in terms of direct integrals over the matter distribution. In particular, show that
(39.27) 2 U ( x , t ) / x j t | x x | d 3 x = 2 π [ V j ( x , t ) W j ( x , t ) ] (39.27) 2 U x , t / x j t x x d 3 x = 2 π V j ( x , t ) W j ( x , t ) {:(39.27)int(del^(2)U(x^('),t)//delx_(j)^(')del t)/(|x-x^(')|)d^(3)x^(')=2pi[V_(j)(x,t)-W_(j)(x,t)]:}\begin{equation*} \int \frac{\partial^{2} U\left(\boldsymbol{x}^{\prime}, t\right) / \partial x_{j}^{\prime} \partial t}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime}=2 \pi\left[V_{j}(\boldsymbol{x}, t)-W_{j}(\boldsymbol{x}, t)\right] \tag{39.27} \end{equation*}(39.27)2U(x,t)/xjt|xx|d3x=2π[Vj(x,t)Wj(x,t)]
where V j V j V_(j)V_{j}Vj and W j W j W_(j)W_{j}Wj are defined by equations ( 39.23 c , d 39.23 c , d 39.23c,d39.23 \mathrm{c}, \mathrm{d}39.23c,d ); also show that
[ U ( x , t ) / x j ] [ U ( x , t ) / x j ] | x x | d 3 x (39.28) = 2 π [ U ( x , t ) ] 2 + 4 π ρ o ( x , t ) U ( x , t ) | x x | d 3 x . U x , t / x j U x , t / x j x x d 3 x (39.28) = 2 π [ U ( x , t ) ] 2 + 4 π ρ o x , t U x , t x x d 3 x . {:[int([del U(x^('),t)//delx_(j)^(')][del U(x^('),t)//delx_(j)^(')])/(|x-x^(')|)d^(3)x^(')],[(39.28)=-2pi[U(x","t)]^(2)+4pi int(rho_(o)(x^('),t)U(x^('),t))/(|x-x^(')|)d^(3)x^(').]:}\begin{align*} \int \frac{\left[\partial U\left(\boldsymbol{x}^{\prime}, t\right) / \partial x_{j}^{\prime}\right]\left[\partial U\left(\boldsymbol{x}^{\prime}, t\right) / \partial x_{j}^{\prime}\right]}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime} & \\ & =-2 \pi[U(\boldsymbol{x}, t)]^{2}+4 \pi \int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right) U\left(\boldsymbol{x}^{\prime}, t\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime} . \tag{39.28} \end{align*}[U(x,t)/xj][U(x,t)/xj]|xx|d3x(39.28)=2π[U(x,t)]2+4πρo(x,t)U(x,t)|xx|d3x.
Note that the terms on the righthand sides of (39.27) and (39.28) are already included in the expressions ( 39.23 e , f 39.23 e , f 39.23e,f39.23 \mathrm{e}, \mathrm{f}39.23e,f ) for g 0 j g 0 j g_(0j)g_{0 j}g0j and g 00 g 00 g_(00)g_{00}g00.

Exercise 39.9. REMOVAL OF Σ Σ Sigma\boldsymbol{\Sigma}Σ TERM FROM g 00 g 00 g_(00)\boldsymbol{g}_{00}g00

Show that the coordinate transformation (39.25) removes the Σ Σ Sigma\SigmaΣ term from the metric coefficient g 00 g 00 g_(00)g_{00}g00 of equation (39.24), as claimed in the text.
Exercise 39.10. VERIFICATION OF FORMS OF POST-NEWTONIAN CORRECTIONS
Verify the claims in the text immediately preceding equations ( 39.23 a , b , c , f 39.23 a , b , c , f 39.23a,b,c,f39.23 \mathrm{a}, \mathrm{b}, \mathrm{c}, \mathrm{f}39.23a,b,c,f ).

§39.9. VELOCITY OF PPN COORDINATES RELATIVE TO "UNIVERSAL REST FRAME"

Thus far it has been assumed tacitly that the center of mass of the solar system is at rest in the PPN coordinate system. Is this really a permissible assumption? Put differently, can one always so adjust the PPN coordinate system that its origin moves with any desired velocity (e.g., that of the solar system); or is the PPN coordinate system rigidly and irrevocably attached to some "universal rest frame"?
In general relativity, the geometry of curved spacetime picks out no preferred coordinate frames (except in cases with special symmetry). Therefore, one expects the velocity of the PPN coordinate frame to be freely specifiable. Put differently, one expects the entire PPN formalism, for general relativity, to be invariant under Lorentz transformations of the PPN coordinates [combined, perhaps, with "infinitesimal coordinate transformations" to maintain the gauge conditions that the " Σ Σ Sigma\SigmaΣ " and " U j k U j k U_(jk)U_{j k}Ujk " terms of (39.24) and (39.23a) be absent]. By contrast, in Ni's theory of gravity (Box 39.1) the geometry of spacetime always picks out a preferred coordinate frame: the "rest frame of the universe." One would not be surprised, in this case, to find the PPN coordinate frame rigidly attached to the universal rest frame.
The above intuition is correct, according to calculations by Will (1971d) and by Will and Nordtvedt (1972). When dealing with general relativity and other theories with little or no "prior geometry," one can freely specify the velocity of the PPN coordinate system (at some initial instant of time). But for theories like Ni's, with a preferred "universal rest frame" ("preferred-frame theories"), only in the preferred frame can the post-Newtonian metric assume the form derived in the last section [equations (39.23)]. This restriction on the PPN metric does not mean that one is confined, in preferred-frame theories, to perform all calculations in the universal rest frame. Rather, it means that for such theories the PPN metric requires generalization to take account of coordinate-frame motion relative to the universal rest frame.
The required generalization can be achieved by subjecting the PPN metric (39.23) to (1) a Lorentz boost from the preferred frame { x OLD α } x OLD  α {x_("OLD ")^(alpha)}\left\{x_{\text {OLD }}^{\alpha}\right\}{xOLD α} to a new PPN frame { x NEW α } x NEW  α {x_("NEW ")^(alpha)}\left\{x_{\text {NEW }}^{\alpha}\right\}{xNEW α}, which moves with velocity w w w\boldsymbol{w}w, plus (2) a change of gauge designed to keep the metric coefficients as simple as possible. The boost-plus-gauge-change is [Will and Nordtvedt (1972)]
Preferred-frame theories of gravity
Generalization of PPN metric to moving frames
(39.29a) x OLD = x NEW + 1 2 ( x NEW w ) w + ( 1 + 1 2 w 2 ) w t NEW + O ( ϵ 5 t NEW + ϵ 4 x NEW ) t OLD = t NEW ( 1 + 1 2 w 2 + 3 8 w 4 ) + ( 1 + 1 2 w 2 ) x NEW w (39.29b) + ( 1 2 Δ 2 + ζ 1 ) w j χ x NEW j + O ( ϵ 6 t NEW + ϵ 5 x NEW ) (39.29c) χ ( t NEW , x NEW ) ρ o ( t NEW , x NEW ) | x NEW x NEW | d 3 x NEW (39.29a) x OLD = x NEW + 1 2 x NEW w w + 1 + 1 2 w 2 w t NEW + O ϵ 5 t NEW + ϵ 4 x NEW t OLD = t NEW 1 + 1 2 w 2 + 3 8 w 4 + 1 + 1 2 w 2 x NEW w (39.29b) + 1 2 Δ 2 + ζ 1 w j χ x NEW j + O ϵ 6 t NEW + ϵ 5 x NEW (39.29c) χ t NEW , x NEW ρ o t NEW , x NEW x NEW x NEW d 3 x NEW {:[(39.29a)x_(OLD)=x_(NEW)+(1)/(2)(x_(NEW)*w)w+(1+(1)/(2)w^(2))wt_(NEW)],[+O(epsilon^(5)t_(NEW)+epsilon^(4)x_(NEW))],[t_(OLD)=t_(NEW)(1+(1)/(2)w^(2)+(3)/(8)w^(4))+(1+(1)/(2)w^(2))x_(NEW)*w],[(39.29b)+ubrace(((1)/(2)Delta_(2)+zeta-1)w_(j)(del chi)/(delx_(NEW)^(j))ubrace)+O(epsilon^(6)t_(NEW)+epsilon^(5)x_(NEW))],[(39.29c)chi(t_(NEW),x_(NEW))-=-intrho_(o)(t_(NEW),x_(NEW)^('))|x_(NEW)-x_(NEW)^(')|d^(3)x_(NEW)^(')]:}\begin{align*} x_{\mathrm{OLD}}= & x_{\mathrm{NEW}}+\frac{1}{2}\left(x_{\mathrm{NEW}} \cdot w\right) w+\left(1+\frac{1}{2} w^{2}\right) w t_{\mathrm{NEW}} \tag{39.29a}\\ & +O\left(\epsilon^{5} t_{\mathrm{NEW}}+\epsilon^{4} x_{\mathrm{NEW}}\right) \\ t_{\mathrm{OLD}}= & t_{\mathrm{NEW}}\left(1+\frac{1}{2} w^{2}+\frac{3}{8} w^{4}\right)+\left(1+\frac{1}{2} w^{2}\right) x_{\mathrm{NEW}} \cdot w \\ & +\underbrace{\left(\frac{1}{2} \Delta_{2}+\zeta-1\right) w_{j} \frac{\partial \chi}{\partial x_{\mathrm{NEW}}^{j}}}+O\left(\epsilon^{6} t_{\mathrm{NEW}}+\epsilon^{5} x_{\mathrm{NEW}}\right) \tag{39.29b}\\ \chi\left(t_{\mathrm{NEW}}, x_{\mathrm{NEW}}\right) \equiv & -\int \rho_{o}\left(t_{\mathrm{NEW}}, x_{\mathrm{NEW}}^{\prime}\right)\left|x_{\mathrm{NEW}}-x_{\mathrm{NEW}}^{\prime}\right| d^{3} x_{\mathrm{NEW}}^{\prime} \tag{39.29c} \end{align*}(39.29a)xOLD=xNEW+12(xNEWw)w+(1+12w2)wtNEW+O(ϵ5tNEW+ϵ4xNEW)tOLD=tNEW(1+12w2+38w4)+(1+12w2)xNEWw(39.29b)+(12Δ2+ζ1)wjχxNEWj+O(ϵ6tNEW+ϵ5xNEW)(39.29c)χ(tNEW,xNEW)ρo(tNEW,xNEW)|xNEWxNEW|d3xNEW
[Note: One insists, in the spirit of the post-Newtonian approximation, that the velocity w w w\boldsymbol{w}w of the new PPN frame relative to the universal rest frame be no larger than the characteristic internal velocities of the system:
(39.30) | w | ϵ . ] (39.30) | w | ϵ . ] {:(39.30)|w|≲epsilon.]:}\begin{equation*} |\boldsymbol{w}| \lesssim \epsilon .] \tag{39.30} \end{equation*}(39.30)|w|ϵ.]
This change of coordinates produces corresponding changes in the velocity of the matter
v OLD = d x OLD d t OLD = v NEW ( 1 w v NEW 1 2 w 2 ) (39.31) + w ( 1 1 2 w v NEW ) + O ( ϵ 5 ) v OLD = d x OLD d t OLD = v NEW 1 w v NEW 1 2 w 2 (39.31) + w 1 1 2 w v NEW + O ϵ 5 {:[v_(OLD)=(dx_(OLD))/(dt_(OLD))=v_(NEW)(1-w*v_(NEW)-(1)/(2)w^(2))],[(39.31)+w(1-(1)/(2)w*v_(NEW))+O(epsilon^(5))]:}\begin{align*} v_{\mathrm{OLD}}=\frac{d x_{\mathrm{OLD}}}{d t_{\mathrm{OLD}}}= & v_{\mathrm{NEW}}\left(1-w \cdot v_{\mathrm{NEW}}-\frac{1}{2} w^{2}\right) \\ & +w\left(1-\frac{1}{2} w \cdot v_{\mathrm{NEW}}\right)+O\left(\epsilon^{5}\right) \tag{39.31} \end{align*}vOLD=dxOLDdtOLD=vNEW(1wvNEW12w2)(39.31)+w(112wvNEW)+O(ϵ5)
A long but straightforward calculation (exercise 39.11) yields the following components for the metric in the new PPN coordinates. [Note: The subscripts NEW are here and hereafter dropped from the notation.]
(39.32a) g j k = δ j k ( 1 + 2 γ U ) + O ( ϵ 4 ) (39.32b) g 0 j = 7 2 Δ 1 V j 1 2 Δ 2 W j + ( α 2 1 2 α 1 ) w j U α 2 w k U k j + O ( ϵ 5 ) , g 00 = 1 + 2 U 2 β U 2 + 4 Ψ ζ Q η D (39.32c) + ( α 2 + α 3 α 1 ) w 2 U + ( 2 α 3 α 1 ) w j V j α 2 w j w k U j k + O ( ϵ 6 ) . (39.32a) g j k = δ j k ( 1 + 2 γ U ) + O ϵ 4 (39.32b) g 0 j = 7 2 Δ 1 V j 1 2 Δ 2 W j + α 2 1 2 α 1 w j U α 2 w k U k j + O ϵ 5 , g 00 = 1 + 2 U 2 β U 2 + 4 Ψ ζ Q η D (39.32c) + α 2 + α 3 α 1 w 2 U + 2 α 3 α 1 w j V j α 2 w j w k U j k + O ϵ 6 . {:[(39.32a)g_(jk)=delta_(jk)(1+2gamma U)+O(epsilon^(4))],[(39.32b)g_(0j)=-(7)/(2)Delta_(1)V_(j)-(1)/(2)Delta_(2)W_(j)+(alpha_(2)-(1)/(2)alpha_(1))w_(j)U-alpha_(2)w_(k)U_(kj)+O(epsilon^(5))","],[g_(00)=-1+2U-2betaU^(2)+4Psi-zeta Q-etaD],[(39.32c)+(alpha_(2)+alpha_(3)-alpha_(1))w^(2)U+(2alpha_(3)-alpha_(1))w_(j)V_(j)-alpha_(2)w_(j)w_(k)U_(jk)+O(epsilon^(6)).]:}\begin{gather*} g_{j k}=\delta_{j k}(1+2 \gamma U)+O\left(\epsilon^{4}\right) \tag{39.32a}\\ g_{0 j}=-\frac{7}{2} \Delta_{1} V_{j}-\frac{1}{2} \Delta_{2} W_{j}+\left(\alpha_{2}-\frac{1}{2} \alpha_{1}\right) w_{j} U-\alpha_{2} w_{k} U_{k j}+O\left(\epsilon^{5}\right), \tag{39.32b}\\ g_{00}=-1+2 U-2 \beta U^{2}+4 \Psi-\zeta Q-\eta \mathscr{D} \\ +\left(\alpha_{2}+\alpha_{3}-\alpha_{1}\right) w^{2} U+\left(2 \alpha_{3}-\alpha_{1}\right) w_{j} V_{j}-\alpha_{2} w_{j} w_{k} U_{j k}+O\left(\epsilon^{6}\right) . \tag{39.32c} \end{gather*}(39.32a)gjk=δjk(1+2γU)+O(ϵ4)(39.32b)g0j=72Δ1Vj12Δ2Wj+(α212α1)wjUα2wkUkj+O(ϵ5),g00=1+2U2βU2+4ΨζQηD(39.32c)+(α2+α3α1)w2U+(2α3α1)wjVjα2wjwkUjk+O(ϵ6).
Here α 1 , α 2 α 1 , α 2 alpha_(1),alpha_(2)\alpha_{1}, \alpha_{2}α1,α2, and α 3 α 3 alpha_(3)\alpha_{3}α3 are certain combinations of PPN parameters
(39.33a) α 1 = 7 Δ 1 + Δ 2 4 γ 4 (39.33b) α 2 = Δ 2 + ζ 1 (39.33c) α 3 = 4 β 1 2 γ 2 ζ (39.33a) α 1 = 7 Δ 1 + Δ 2 4 γ 4 (39.33b) α 2 = Δ 2 + ζ 1 (39.33c) α 3 = 4 β 1 2 γ 2 ζ {:[(39.33a)alpha_(1)=7Delta_(1)+Delta_(2)-4gamma-4],[(39.33b)alpha_(2)=Delta_(2)+zeta-1],[(39.33c)alpha_(3)=4beta_(1)-2gamma-2-zeta]:}\begin{gather*} \alpha_{1}=7 \Delta_{1}+\Delta_{2}-4 \gamma-4 \tag{39.33a}\\ \alpha_{2}=\Delta_{2}+\zeta-1 \tag{39.33b}\\ \alpha_{3}=4 \beta_{1}-2 \gamma-2-\zeta \tag{39.33c} \end{gather*}(39.33a)α1=7Δ1+Δ24γ4(39.33b)α2=Δ2+ζ1(39.33c)α3=4β12γ2ζ
The "gravitational potentials" U , V j , W j , Ψ , a U , V j , W j , Ψ , a U,V_(j),W_(j),Psi,aU, V_{j}, W_{j}, \Psi, \mathfrak{a}U,Vj,Wj,Ψ,a, and D D D\mathscr{D}D appearing here are to be calculated in the new, "moving" PPN coordinate system by the same prescriptions
as one used in the universal rest frame. Thus, their functional forms are the same as previously, but their values at any given event are different (see exercise 39.11):
(39.34a) U ( x , t ) = ρ o ( x , t ) | x x | d 3 x ; (39.34b) V j ( x , t ) = ρ o ( x , t ) v j ( x , t ) | x x | d 3 x ; (39.34c) W j ( x , t ) = ρ o ( x , t ) [ ( x x ) v ( x , t ) ] ( x j x j ) d 3 x | x x | 3 ; Ψ ( x , t ) = ρ o ( x , t ) ψ ( x , t ) | x x | d 3 x , (39.34d) ψ = β 1 v 2 + β 2 U + 1 2 β 3 Π + 3 2 β 4 p / ρ o ; (39.34e) a ( x , t ) = ρ o ( x , t ) [ ( x x ) v ( x , t ) ] 2 | x x | 3 d 3 x ; (39.34f) D ( x , t ) = [ x x | 3 [ t j k ^ ( x , t ) 1 3 δ j k t l l ( x , t ) ] ( x j x j ) ( x k x k ) d 3 x . (39.34a) U ( x , t ) = ρ o x , t x x d 3 x ; (39.34b) V j ( x , t ) = ρ o x , t v j x , t x x d 3 x ; (39.34c) W j ( x , t ) = ρ o x , t x x v x , t x j x j d 3 x x x 3 ; Ψ ( x , t ) = ρ o x , t ψ x , t x x d 3 x , (39.34d) ψ = β 1 v 2 + β 2 U + 1 2 β 3 Π + 3 2 β 4 p / ρ o ; (39.34e) a ( x , t ) = ρ o x , t x x v x , t 2 x x 3 d 3 x ; (39.34f) D ( x , t ) = x x 3 t j k ^ x , t 1 3 δ j k t l l x , t x j x j x k x k d 3 x . {:[(39.34a)U(x","t)=int(rho_(o)(x^('),t))/(|x-x^(')|)d^(3)x^(');],[(39.34b)V_(j)(x","t)=int(rho_(o)(x^('),t)v_(j)(x^('),t))/(|x-x^(')|)d^(3)x^(');],[(39.34c)W_(j)(x","t)=int(rho_(o)(x^('),t)[(x-x^('))*v(x^('),t)](x_(j)-x_(j)^('))d^(3)x^('))/(|x-x^(')|^(3));],[Psi(x","t)=int(rho_(o)(x^('),t)psi(x^('),t))/(|x-x^(')|)d^(3)x^(')","],[(39.34d)psi=beta_(1)v^(2)+beta_(2)U+(1)/(2)beta_(3)Pi+(3)/(2)beta_(4)p//rho_(o);],[(39.34e)a(x","t)=int(rho_(o)(x^('),t)[(x-x^('))*v(x^('),t)]^(2))/(|x-x^(')|^(3))d^(3)x^(');],[(39.34f)D(x","t)=int([x-x^(')|^(3))/([t_(j hat(k))(x^('),t)-(1)/(3)delta_(jk)t_(ll)(x^('),t)](x_(j)-x_(j)^('))(x_(k)-x_(k)^(')))d^(3)x^(').]:}\begin{gather*} U(\boldsymbol{x}, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime} ; \tag{39.34a}\\ V_{j}(\boldsymbol{x}, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right) v_{j}\left(\boldsymbol{x}^{\prime}, t\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime} ; \tag{39.34b}\\ W_{j}(\boldsymbol{x}, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)\left[\left(\boldsymbol{x}-\boldsymbol{x}^{\prime}\right) \cdot \boldsymbol{v}\left(\boldsymbol{x}^{\prime}, t\right)\right]\left(x_{j}-x_{j}^{\prime}\right) d^{3} x^{\prime}}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|^{3}} ; \tag{39.34c}\\ \Psi(\boldsymbol{x}, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right) \psi\left(\boldsymbol{x}^{\prime}, t\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|} d^{3} x^{\prime}, \\ \psi=\beta_{1} \boldsymbol{v}^{2}+\beta_{2} U+\frac{1}{2} \beta_{3} \Pi+\frac{3}{2} \beta_{4} p / \rho_{o} ; \tag{39.34d}\\ a(\boldsymbol{x}, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)\left[\left(\boldsymbol{x}-\boldsymbol{x}^{\prime}\right) \cdot \boldsymbol{v}\left(\boldsymbol{x}^{\prime}, t\right)\right]^{2}}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|^{3}} d^{3} x^{\prime} ; \tag{39.34e}\\ \mathscr{D}(\boldsymbol{x}, t)=\int \frac{\left[\boldsymbol{x}-\left.\boldsymbol{x}^{\prime}\right|^{3}\right.}{\left[t_{j \hat{k}}\left(\boldsymbol{x}^{\prime}, t\right)-\frac{1}{3} \delta_{j k} t_{l l}\left(\boldsymbol{x}^{\prime}, t\right)\right]\left(x_{j}-x_{j}^{\prime}\right)\left(x_{k}-x_{k}^{\prime}\right)} d^{3} x^{\prime} . \tag{39.34f} \end{gather*}(39.34a)U(x,t)=ρo(x,t)|xx|d3x;(39.34b)Vj(x,t)=ρo(x,t)vj(x,t)|xx|d3x;(39.34c)Wj(x,t)=ρo(x,t)[(xx)v(x,t)](xjxj)d3x|xx|3;Ψ(x,t)=ρo(x,t)ψ(x,t)|xx|d3x,(39.34d)ψ=β1v2+β2U+12β3Π+32β4p/ρo;(39.34e)a(x,t)=ρo(x,t)[(xx)v(x,t)]2|xx|3d3x;(39.34f)D(x,t)=[xx|3[tjk^(x,t)13δjktll(x,t)](xjxj)(xkxk)d3x.
The quantity U j k U j k U_(jk)U_{j k}Ujk is the gravitational potential defined in equation (39.23a):
(39.34~g) U j k ( x , t ) = ρ o ( x , t ) ( x j x j ) ( x k x k ) | x x | 3 d 3 x (39.34~g) U j k ( x , t ) = ρ o x , t x j x j x k x k x x 3 d 3 x {:(39.34~g)U_(jk)(x","t)=int(rho_(o)(x^('),t)(x_(j)-x_(j)^('))(x_(k)-x_(k)^(')))/(|x-x^(')|^(3))d^(3)x^('):}\begin{equation*} U_{j k}(\boldsymbol{x}, t)=\int \frac{\rho_{o}\left(\boldsymbol{x}^{\prime}, t\right)\left(x_{j}-x_{j}^{\prime}\right)\left(x_{k}-x_{k}^{\prime}\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|^{3}} d^{3} x^{\prime} \tag{39.34~g} \end{equation*}(39.34~g)Ujk(x,t)=ρo(x,t)(xjxj)(xkxk)|xx|3d3x
Notice that the velocity w w w\boldsymbol{w}w of the PPN coordinate system relative to the universal rest frame appears explicitly in the PPN metric only if one or more of the coefficients α 1 , α 2 , α 3 α 1 , α 2 , α 3 alpha_(1),alpha_(2),alpha_(3)\alpha_{1}, \alpha_{2}, \alpha_{3}α1,α2,α3, is nonzero. Thus, theories with α 1 = α 2 = α 3 = 0 α 1 = α 2 = α 3 = 0 alpha_(1)=alpha_(2)=alpha_(3)=0\alpha_{1}=\alpha_{2}=\alpha_{3}=0α1=α2=α3=0 (e.g., general relativity) possess no preferred universal rest frame in the post-Newtonian limit; all their PPN frames are "created equal." By contrast, theories with at least one of α 1 , α 2 , α 3 α 1 , α 2 , α 3 alpha_(1),alpha_(2),alpha_(3)\alpha_{1}, \alpha_{2}, \alpha_{3}α1,α2,α3, nonzero (e.g., Ni's theory) do possess a preferred frame.
The generalized form (39.32) of the PPN metric, by virtue of the process used to construct it, is invariant under a Lorentz boost plus a gauge adjustment ["PostGalilean transformation"; see Chandrasekhar and Contopolous (1967)]:
x OLD = x NEW + 1 2 ( x NEW β ) β + ( 1 + 1 2 β 2 ) β t NEW (39.35) + O ( ϵ 5 t NEW + ϵ 4 x NEW ) t OLD = ( 1 + + 1 2 β 2 + 3 8 β 4 ) t NEW + ( 1 + 1 2 β 2 ) x NEW β + ( 1 2 Δ 2 + ζ 1 1 ) β NEW χ + O ( ϵ 6 t NEW + ϵ 5 x NEW ) x OLD = x NEW + 1 2 x NEW β β + 1 + 1 2 β 2 β t NEW (39.35) + O ϵ 5 t NEW + ϵ 4 x NEW t OLD = ( 1 + + 1 2 β 2 + 3 8 β 4 t NEW + 1 + 1 2 β 2 x NEW β + 1 2 Δ 2 + ζ 1 1 β NEW χ + O ϵ 6 t NEW + ϵ 5 x NEW {:[x_(OLD)=x_(NEW)+(1)/(2)(x_(NEW)*beta)beta+(1+(1)/(2)beta^(2))betat_(NEW)],[(39.35)+O(epsilon^(5)t_(NEW)+epsilon^(4)x_(NEW))],[t_(OLD)=(1+],[{:+(1)/(2)beta^(2)+(3)/(8)beta^(4))t_(NEW)+(1+(1)/(2)beta^(2))x_(NEW)*beta],[+((1)/(2)Delta_(2)+zeta_(1)-1)beta*grad_(NEW)chi+O(epsilon^(6)t_(NEW)+epsilon^(5)x_(NEW))]:}\begin{align*} & \boldsymbol{x}_{\mathrm{OLD}}= \boldsymbol{x}_{\mathrm{NEW}}+\frac{1}{2}\left(\boldsymbol{x}_{\mathrm{NEW}} \cdot \boldsymbol{\beta}\right) \boldsymbol{\beta}+\left(1+\frac{1}{2} \beta^{2}\right) \boldsymbol{\beta} t_{\mathrm{NEW}} \\ &+O\left(\epsilon^{5} t_{\mathrm{NEW}}+\epsilon^{4} x_{\mathrm{NEW}}\right) \tag{39.35}\\ & t_{\mathrm{OLD}}=(1+ \\ &\left.+\frac{1}{2} \beta^{2}+\frac{3}{8} \beta^{4}\right) t_{\mathrm{NEW}}+\left(1+\frac{1}{2} \beta^{2}\right) x_{\mathrm{NEW}} \cdot \boldsymbol{\beta} \\ &+\left(\frac{1}{2} \Delta_{2}+\zeta_{1}-1\right) \boldsymbol{\beta} \cdot \nabla_{\mathrm{NEW}} \chi+O\left(\epsilon^{6} t_{\mathrm{NEW}}+\epsilon^{5} x_{\mathrm{NEW}}\right) \end{align*}xOLD=xNEW+12(xNEWβ)β+(1+12β2)βtNEW(39.35)+O(ϵ5tNEW+ϵ4xNEW)tOLD=(1++12β2+38β4)tNEW+(1+12β2)xNEWβ+(12Δ2+ζ11)βNEWχ+O(ϵ6tNEW+ϵ5xNEW)
Of course, it is also invariant under spatial rotations.

EXERCISE

Exercise 39.11. TRANSFORMATION TO MOVING FRAME

Show that the change of coordinates (39.29) changes the PPN metric coefficients from the form (39.23) to the form (39.32). [Hints: (1) Keep firmly in mind the fact that the potentials U , V j , W j , A U , V j , W j , A U,V_(j),W_(j),AU, V_{j}, W_{j}, \mathscr{A}U,Vj,Wj,A, and D D D\mathscr{D}D are not scalar fields. Each coordinate system possesses its own potentials. For example, by using equations (39.29) in the integral for U oLD U oLD  U_("oLD ")U_{\text {oLD }}UoLD , one finds
(39.36) U OLD ( x OLD , t OLD ) = ρ o ( x OLD , t OLD ) | x OLD x OLD | d 3 x OLD = [ U NEW w j ( V j NEW W j NEW ) + 1 2 w j w k x , j k ] x NEW , t NBW + O ( ϵ 6 ) . (39.36) U OLD x OLD , t OLD = ρ o x OLD , t OLD x OLD x OLD d 3 x OLD = U NEW w j V j NEW W j NEW + 1 2 w j w k x , j k x NEW , t NBW + O ϵ 6 . {:[(39.36)U_(OLD)(x_(OLD),t_(OLD))=int(rho_(o)(x_(OLD)^('),t_(OLD)))/(|x_(OLD)-x_(OLD)^(')|)d^(3)x_(OLD)^(')],[=[U_(NEW)-w_(j)(V_(jNEW)-W_(jNEW))+(1)/(2)w_(j)w_(k)x_(,jk)]_(x_(NEW),t_(NBW))+O(epsilon^(6)).]:}\begin{align*} U_{\mathrm{OLD}}\left(x_{\mathrm{OLD}}, t_{\mathrm{OLD}}\right) & =\int \frac{\rho_{o}\left(x_{\mathrm{OLD}}^{\prime}, t_{\mathrm{OLD}}\right)}{\left|x_{\mathrm{OLD}}-x_{\mathrm{OLD}}^{\prime}\right|} d^{3} x_{\mathrm{OLD}}^{\prime} \tag{39.36}\\ & =\left[U_{\mathrm{NEW}}-w_{j}\left(V_{j \mathrm{NEW}}-W_{j \mathrm{NEW}}\right)+\frac{1}{2} w_{j} w_{k} x_{, j k}\right]_{x_{\mathrm{NEW}}, t_{\mathrm{NBW}}}+O\left(\epsilon^{6}\right) . \end{align*}(39.36)UOLD(xOLD,tOLD)=ρo(xOLD,tOLD)|xOLDxOLD|d3xOLD=[UNEWwj(VjNEWWjNEW)+12wjwkx,jk]xNEW,tNBW+O(ϵ6).
(2) The law of baryon conservation (39.44) may be useful.]

§39.10. PPN STRESS-ENERGY TENSOR

The motion of the solar system is governed by the equations T α β ; β = 0 T α β ; β = 0 T^(alpha beta);beta=0T^{\alpha \beta} ; \beta=0Tαβ;β=0. Before studying them, one must calculate the post-Newtonian corrections to the stress-energy tensor in the PPN coordinate frame. This requires a transformation from the comoving, orthonormal frame ω α ^ ω α ^ omega^( hat(alpha))\boldsymbol{\omega}^{\hat{\alpha}}ωα^, where
(39.37) T 0 ^ 0 ^ = ρ o ( 1 + Π ) , T 0 ^ j ^ = 0 , T j ^ k ^ = t j k ^ (39.37) T 0 ^ 0 ^ = ρ o ( 1 + Π ) , T 0 ^ j ^ = 0 , T j ^ k ^ = t j k ^ {:(39.37)T^( hat(0) hat(0))=rho_(o)(1+Pi)","quadT^( hat(0) hat(j))=0","quadT^( hat(j) hat(k))=t_(j hat(k)):}\begin{equation*} T^{\hat{0} \hat{0}}=\rho_{o}(1+\Pi), \quad T^{\hat{0} \hat{j}}=0, \quad T^{\hat{j} \hat{k}}=t_{j \hat{k}} \tag{39.37} \end{equation*}(39.37)T0^0^=ρo(1+Π),T0^j^=0,Tj^k^=tjk^
to the coordinate frame. One can effect this transformation in two stages: stage 2 is a transformation
(39.38a) ω 0 ~ [ 1 U + O ( ϵ 4 ) ] d t + [ 7 2 Δ 1 V j + 1 2 Δ 2 W j + ( 1 2 α 1 α 2 ) w j U + α 2 w k U k j + O ( ϵ 5 ) ] d x j (39.38b) ω j ~ [ ( 1 + γ U ) δ j k + O ( ϵ 4 ) ] d x k + O ( ϵ 5 ) d t (39.38a) ω 0 ~ 1 U + O ϵ 4 d t + 7 2 Δ 1 V j + 1 2 Δ 2 W j + 1 2 α 1 α 2 w j U + α 2 w k U k j + O ϵ 5 d x j (39.38b) ω j ~ ( 1 + γ U ) δ j k + O ϵ 4 d x k + O ϵ 5 d t {:[(39.38a)omega^( tilde(0))-=[1-U+O(epsilon^(4))]dt],[+[(7)/(2)Delta_(1)V_(j)+(1)/(2)Delta_(2)W_(j)+((1)/(2)alpha_(1)-alpha_(2))w_(j)U+alpha_(2)w_(k)U_(kj)+O(epsilon^(5))]dx^(j)],[(39.38b)omega^( tilde(j))-=[(1+gamma U)delta_(jk)+O(epsilon^(4))]dx^(k)+O(epsilon^(5))dt]:}\begin{gather*} \boldsymbol{\omega}^{\tilde{0}} \equiv\left[1-U+O\left(\epsilon^{4}\right)\right] \boldsymbol{d} t \tag{39.38a}\\ +\left[\frac{7}{2} \Delta_{1} V_{j}+\frac{1}{2} \Delta_{2} W_{j}+\left(\frac{1}{2} \alpha_{1}-\alpha_{2}\right) w_{j} U+\alpha_{2} w_{k} U_{k j}+O\left(\epsilon^{5}\right)\right] \boldsymbol{d} x^{j} \\ \boldsymbol{\omega}^{\tilde{j}} \equiv\left[(1+\gamma U) \delta_{j k}+O\left(\epsilon^{4}\right)\right] \boldsymbol{d} x^{k}+O\left(\epsilon^{5}\right) \boldsymbol{d} t \tag{39.38b} \end{gather*}(39.38a)ω0~[1U+O(ϵ4)]dt+[72Δ1Vj+12Δ2Wj+(12α1α2)wjU+α2wkUkj+O(ϵ5)]dxj(39.38b)ωj~[(1+γU)δjk+O(ϵ4)]dxk+O(ϵ5)dt
between the coordinate frame and an orthonormal frame attached to it; stage 1 is a pure Lorentz transformation (boost) between the two orthonormal frames ω α ~ ω α ~ omega^( tilde(alpha))\boldsymbol{\omega}^{\tilde{\alpha}}ωα~ and ω α ^ ω α ^ omega^( hat(alpha))\boldsymbol{\omega}^{\hat{\alpha}}ωα^. The 4 -velocity of this boost is minus the 4 -velocity of the matter, which has components
(39.40) u j = v j u 0 , u 0 = 1 + 1 2 v 2 + U + O ( ϵ 4 ) in coord. frame; u j ~ = v j ~ u o ~ , u 0 ~ = 1 + 1 2 v 2 + O ( ϵ 4 ) , v j ~ = v j [ 1 + ( 1 + γ ) U ] } in ω α ~ frame. (39.40) u j = v j u 0 , u 0 = 1 + 1 2 v 2 + U + O ϵ 4  in coord. frame;  u j ~ = v j ~ u o ~ , u 0 ~ = 1 + 1 2 v 2 + O ϵ 4 , v j ~ = v j [ 1 + ( 1 + γ ) U ]  in  ω α ~  frame.  {:(39.40){:[u^(j)=v_(j)u^(0)","quadu^(0)=1+(1)/(2)v^(2)+U+O(epsilon^(4))quad" in coord. frame; "],[u^( tilde(j))=v_( tilde(j))u^( tilde(o))","quadu^( tilde(0))=1+(1)/(2)v^(2)+O(epsilon^(4))","],[v_( tilde(j))=v_(j)[1+(1+gamma)U]]}" in "omega^( tilde(alpha))" frame. ":}\left.\begin{array}{c} u^{j}=v_{j} u^{0}, \quad u^{0}=1+\frac{1}{2} v^{2}+U+O\left(\epsilon^{4}\right) \quad \text { in coord. frame; } \\ u^{\tilde{j}}=v_{\tilde{j}} u^{\tilde{o}}, \quad u^{\tilde{0}}=1+\frac{1}{2} v^{2}+O\left(\epsilon^{4}\right), \tag{39.40}\\ v_{\tilde{j}}=v_{j}[1+(1+\gamma) U] \end{array}\right\} \text { in } \boldsymbol{\omega}^{\tilde{\alpha}} \text { frame. }(39.40)uj=vju0,u0=1+12v2+U+O(ϵ4) in coord. frame; uj~=vj~uo~,u0~=1+12v2+O(ϵ4),vj~=vj[1+(1+γ)U]} in ωα~ frame. 
Combining the boost, which has ordinary velocity β j ~ = v j ~ β j ~ = v j ~ beta_( tilde(j))=-v_( tilde(j))\beta_{\tilde{j}}=-v_{\tilde{j}}βj~=vj~, with the transformation (39.38), and then inverting, one obtains the result (exercise 39.12)
d x α = A α β ω β ^ , { ω β ^ = orthonormal comoving basis, d x α = PPN coordinate basis; A 0 o ^ = 1 + 1 2 v 2 + U + O ( ϵ 4 ) , A 0 j = v j [ 1 + 1 2 v 2 + ( 2 + γ ) U ] 7 2 Δ 1 V j 1 2 Δ 2 W j (39.41) + ( α 2 1 2 α 1 ) w j U α 2 w k U k j + O ( ϵ 5 ) , A j 0 ^ = v j [ 1 + 1 2 v 2 + U ] + O ( ϵ 5 ) , A j k ^ = ( 1 γ U ) δ j k + 1 2 v j v k + O ( ϵ 4 ) . d x α = A α β ω β ^ , ω β ^ =  orthonormal comoving basis,  d x α =  PPN coordinate basis;  A 0 o ^ = 1 + 1 2 v 2 + U + O ϵ 4 , A 0 j = v j 1 + 1 2 v 2 + ( 2 + γ ) U 7 2 Δ 1 V j 1 2 Δ 2 W j (39.41) + α 2 1 2 α 1 w j U α 2 w k U k j + O ϵ 5 , A j 0 ^ = v j 1 + 1 2 v 2 + U + O ϵ 5 , A j k ^ = ( 1 γ U ) δ j k + 1 2 v j v k + O ϵ 4 . {:[dx^(alpha)=A^(alpha)_(beta)omega^( hat(beta))","quad{[omega^( hat(beta))=" orthonormal comoving basis, "],[dx^(alpha)=" PPN coordinate basis; "]:}],[A^(0)_( hat(o))=1+(1)/(2)v^(2)+U+O(epsilon^(4))","],[A^(0)_(j)=v_(j)[1+(1)/(2)v^(2)+(2+gamma)U]-(7)/(2)Delta_(1)V_(j)-(1)/(2)Delta_(2)W_(j)],[(39.41)+(alpha_(2)-(1)/(2)alpha_(1))w_(j)U-alpha_(2)w_(k)U_(kj)+O(epsilon^(5))","],[A^(j)_( hat(0))=v_(j)[1+(1)/(2)v^(2)+U]+O(epsilon^(5))","],[A^(j)_( hat(k))=(1-gamma U)delta_(jk)+(1)/(2)v_(j)v_(k)+O(epsilon^(4)).]:}\begin{align*} & \boldsymbol{d} x^{\alpha}= A^{\alpha}{ }_{\beta} \boldsymbol{\omega}^{\hat{\beta}}, \quad\left\{\begin{array}{l} \boldsymbol{\omega}^{\hat{\beta}}=\text { orthonormal comoving basis, } \\ \boldsymbol{d} x^{\alpha}=\text { PPN coordinate basis; } \end{array}\right. \\ & A^{0}{ }_{\hat{o}}=1+\frac{1}{2} v^{2}+U+O\left(\epsilon^{4}\right), \\ & A^{0}{ }_{j}= v_{j}\left[1+\frac{1}{2} v^{2}+(2+\gamma) U\right]-\frac{7}{2} \Delta_{1} V_{j}-\frac{1}{2} \Delta_{2} W_{j} \\ &+\left(\alpha_{2}-\frac{1}{2} \alpha_{1}\right) w_{j} U-\alpha_{2} w_{k} U_{k j}+O\left(\epsilon^{5}\right), \tag{39.41}\\ & A^{j}{ }_{\hat{0}}= v_{j}\left[1+\frac{1}{2} v^{2}+U\right]+O\left(\epsilon^{5}\right), \\ & A^{j}{ }_{\hat{k}}=(1-\gamma U) \delta_{j k}+\frac{1}{2} v_{j} v_{k}+O\left(\epsilon^{4}\right) . \end{align*}dxα=Aαβωβ^,{ωβ^= orthonormal comoving basis, dxα= PPN coordinate basis; A0o^=1+12v2+U+O(ϵ4),A0j=vj[1+12v2+(2+γ)U]72Δ1Vj12Δ2Wj(39.41)+(α212α1)wjUα2wkUkj+O(ϵ5),Aj0^=vj[1+12v2+U]+O(ϵ5),Ajk^=(1γU)δjk+12vjvk+O(ϵ4).
Transformation from rest frame of matter to PPN coordinate frame
This transformation, when applied to the stress-energy tensor (39.37) yields, in the PPN coordinate frame,
(39.42a) T 00 = ρ o ( 1 + Π + v 2 + 2 U ) + O ( ρ o ϵ 4 ) (39.42b) T 0 j = ρ o ( 1 + Π + v 2 + 2 U ) v j + t j ^ m ^ v m + O ( ρ o ϵ 5 ) T j k = t j k ^ ( 1 2 γ U ) + ρ o ( 1 + Π + v 2 + 2 U ) v j v k (39.42c) + 1 2 ( v j t k ^ m ^ v m + v k t j ^ m ^ v m ) + O ( ρ o ϵ 6 ) (39.42a) T 00 = ρ o 1 + Π + v 2 + 2 U + O ρ o ϵ 4 (39.42b) T 0 j = ρ o 1 + Π + v 2 + 2 U v j + t j ^ m ^ v m + O ρ o ϵ 5 T j k = t j k ^ ( 1 2 γ U ) + ρ o 1 + Π + v 2 + 2 U v j v k (39.42c) + 1 2 v j t k ^ m ^ v m + v k t j ^ m ^ v m + O ρ o ϵ 6 {:[(39.42a)T^(00)=rho_(o)(1+Pi+v^(2)+2U)+O(rho_(o)epsilon^(4))],[(39.42b)T^(0j)=rho_(o)(1+Pi+v^(2)+2U)v_(j)+t_( hat(j) hat(m))v_(m)+O(rho_(o)epsilon^(5))],[T^(jk)=t_(j hat(k))(1-2gamma U)+rho_(o)(1+Pi+v^(2)+2U)v_(j)v_(k)],[(39.42c)+(1)/(2)(v_(j)t_( hat(k) hat(m))v_(m)+v_(k)t_( hat(j) hat(m))v_(m))+O(rho_(o)epsilon^(6))]:}\begin{align*} T^{00}= & \rho_{o}\left(1+\Pi+v^{2}+2 U\right)+O\left(\rho_{o} \epsilon^{4}\right) \tag{39.42a}\\ T^{0 j}= & \rho_{o}\left(1+\Pi+v^{2}+2 U\right) v_{j}+t_{\hat{j} \hat{m}} v_{m}+O\left(\rho_{o} \epsilon^{5}\right) \tag{39.42b}\\ T^{j k}= & t_{j \hat{k}}(1-2 \gamma U)+\rho_{o}\left(1+\Pi+v^{2}+2 U\right) v_{j} v_{k} \\ & +\frac{1}{2}\left(v_{j} t_{\hat{k} \hat{m}} v_{m}+v_{k} t_{\hat{j} \hat{m}} v_{m}\right)+O\left(\rho_{o} \epsilon^{6}\right) \tag{39.42c} \end{align*}(39.42a)T00=ρo(1+Π+v2+2U)+O(ρoϵ4)(39.42b)T0j=ρo(1+Π+v2+2U)vj+tj^m^vm+O(ρoϵ5)Tjk=tjk^(12γU)+ρo(1+Π+v2+2U)vjvk(39.42c)+12(vjtk^m^vm+vktj^m^vm)+O(ρoϵ6)

Exercise 39.12. THE TRANSFORMATION BETWEEN COMOVING FRAME AND PPN FRAME

Carry out the details of the derivation of the transformation matrix (39.41); and in the process calculate the correction of O ( ϵ 4 ) O ϵ 4 O(epsilon^(4))O\left(\epsilon^{4}\right)O(ϵ4) to A 0 0 A 0 0 A^(0)_(0)A^{0}{ }_{0}A00.

§39.11. PPN EQUATIONS OF MOTION

The post-Newtonian corrections to the Newtonian equations of motion (39.15) and (39.16) are derived from the law of conservation of baryon number ( ρ o u α ) ; α = 0 ρ o u α ; α = 0 (rho_(o)u^(alpha))_(;alpha)=0\left(\rho_{o} u^{\alpha}\right)_{; \alpha}=0(ρouα);α=0, and from the law of conservation of local energy-momentum, T α β ; β = 0 T α β ; β = 0 T^(alpha beta)_(;beta)=0T^{\alpha \beta}{ }_{; \beta}=0Tαβ;β=0. The simplest of the equations of motion is the conservation of baryon number. Its exact expression is ( ρ o u α ) ; α = ( 1 / g ) ( g ρ o u α ) , α = 0 ρ o u α ; α = ( 1 / g ) g ρ o u α , α = 0 (rho_(o)u^(alpha))_(;alpha)=(1//sqrt(-g))(sqrt(-g)rho_(o)u^(alpha))_(,alpha)=0\left(\rho_{o} u^{\alpha}\right)_{; \alpha}=(1 / \sqrt{-g})\left(\sqrt{-g} \rho_{o} u^{\alpha}\right)_{, \alpha}=0(ρouα);α=(1/g)(gρouα),α=0. Define a new quantity
(39.43) ρ ρ o ( 1 + 1 2 v 2 + 3 γ U ) = ρ o u 0 g + O ( ρ o ϵ 4 ) (39.43) ρ ρ o 1 + 1 2 v 2 + 3 γ U = ρ o u 0 g + O ρ o ϵ 4 {:[(39.43)rho^(**)-=rho_(o)(1+(1)/(2)v^(2)+3gamma U)],[=rho_(o)u^(0)sqrt(-g)+O(rho_(o)epsilon^(4))]:}\begin{align*} \rho^{*} & \equiv \rho_{o}\left(1+\frac{1}{2} v^{2}+3 \gamma U\right) \tag{39.43}\\ & =\rho_{o} u^{0} \sqrt{-g}+O\left(\rho_{o} \epsilon^{4}\right) \end{align*}(39.43)ρρo(1+12v2+3γU)=ρou0g+O(ρoϵ4)
Law of energy conservation
Post-Newtonian Euler equation
[see (39.39) for u 0 u 0 u^(0)u^{0}u0, and (39.32) for the metric]. Then rest-mass conservation takes on the same form as at the Newtonian order (39.15a), except now it is more accurate:
(39.44) ρ , t + ( ρ v j ) , j = 0 + errors of O ( ρ o , j ϵ 5 ) (39.44) ρ , t + ρ v j , j = 0 +  errors of  O ρ o , j ϵ 5 {:(39.44)rho_(,t)^(**)+(rho^(**)v_(j))_(,j)=0+" errors of "O(rho_(o,j)epsilon^(5)):}\begin{equation*} \rho_{, t}^{*}+\left(\rho^{*} v_{j}\right)_{, j}=0+\text { errors of } O\left(\rho_{o, j} \epsilon^{5}\right) \tag{39.44} \end{equation*}(39.44)ρ,t+(ρvj),j=0+ errors of O(ρo,jϵ5)
The next simplest equation of motion is T ; α 0 α = 0 T ; α 0 α = 0 T_(;alpha)^(0alpha)=0T_{; \alpha}^{0 \alpha}=0T;α0α=0. Straightforward evaluation, using the metric of equations (39.32) and the stress-energy tensor of equations (39.42), yields
[ ρ o ( 1 + Π + v 2 + 2 U ) ] , t + [ ρ o ( 1 + Π + v 2 + 2 U ) v j + t j ^ m ^ v m ] , j (39.45) + ( 3 γ 2 ) ρ o U , t + ( 3 γ 3 ) ρ o v k U , k = O ( ρ o , j ϵ 5 ) . ρ o 1 + Π + v 2 + 2 U , t + ρ o 1 + Π + v 2 + 2 U v j + t j ^ m ^ v m , j (39.45) + ( 3 γ 2 ) ρ o U , t + ( 3 γ 3 ) ρ o v k U , k = O ρ o , j ϵ 5 . {:[[rho_(o)(1+Pi+v^(2)+2U)]_(,t)+[rho_(o)(1+Pi+v^(2)+2U)v_(j)+t_( hat(j) hat(m))v_(m)]_(,j)],[(39.45)+(3gamma-2)rho_(o)U_(,t)+(3gamma-3)rho_(o)v_(k)U_(,k)=O(rho_(o,j)epsilon^(5)).]:}\begin{align*} {\left[\rho_{o}\left(1+\Pi+v^{2}+2 U\right)\right]_{, t} } & +\left[\rho_{o}\left(1+\Pi+v^{2}+2 U\right) v_{j}+t_{\hat{j} \hat{m}} v_{m}\right]_{, j} \\ & +(3 \gamma-2) \rho_{o} U_{, t}+(3 \gamma-3) \rho_{o} v_{k} U_{, k}=O\left(\rho_{o, j} \epsilon^{5}\right) . \tag{39.45} \end{align*}[ρo(1+Π+v2+2U)],t+[ρo(1+Π+v2+2U)vj+tj^m^vm],j(39.45)+(3γ2)ρoU,t+(3γ3)ρovkU,k=O(ρo,jϵ5).
By subtracting equation (39.44) from this, and using the Newtonian equations of motion (39.15) and (39.16) to simplify several terms where the Newtonian approximation is adequate, one obtains
(39.46) ρ o d Π / d t + t j k ^ v j , k = 0 + errors of O ( ρ o , j ϵ 5 ) (39.46) ρ o d Π / d t + t j k ^ v j , k = 0 +  errors of  O ρ o , j ϵ 5 {:(39.46)rho_(o)d Pi//dt+t_(j hat(k))v_(j,k)=0+" errors of "O(rho_(o,j)epsilon^(5)):}\begin{equation*} \rho_{o} d \Pi / d t+t_{j \hat{k}} v_{j, k}=0+\text { errors of } O\left(\rho_{o, j} \epsilon^{5}\right) \tag{39.46} \end{equation*}(39.46)ρodΠ/dt+tjk^vj,k=0+ errors of O(ρo,jϵ5)
Notice that this is nothing but the first law of thermodynamics (local energy conservation) with energy flow through the matter being neglected. (Neglecting energy flow was justified in §39.5.) This first law of thermodynamics is actually a post-Newtonian equation in the context of hydrodynamics, rather than a Newtonian equation, because Π Π Pi\PiΠ does not affect the hydrodynamic motion at Newtonian order (see §39.7).
The last of the equations of motion, T j α ; α = 0 T j α ; α = 0 T^(j alpha)_(;alpha)=0T^{j \alpha}{ }_{; \alpha}=0Tjα;α=0, reduces to the post-Newtonian Euler equation
ρ d v j d t ρ U , j + [ t j ^ k ^ ( 1 + 3 γ U ) ] , k t j ^ k ^ , k ( 1 2 v 2 + Π ) t j ^ k t k ^ , ρ + ρ d d t [ ( 2 γ + 2 ) U v j 1 2 ( 7 Δ 1 + Δ 2 ) V j 1 2 α 1 U w j ] v j ρ U , t + v k t k ^ j ^ , t + 1 2 Δ 2 ρ ( V j W j ) , t + 1 2 ρ [ ( 7 Δ 1 + Δ 2 ) v k + ( α 1 2 α 3 ) w k ] V k , j (39.47) ρ [ 2 Ψ 1 2 ζ a 1 2 η D 1 2 α 2 w i w k U i k + α 2 w i ( V i W i ) ] , j ρ U , j [ γ v 2 1 2 α 1 w v + 1 2 ( α 2 + α 3 α 1 ) w 2 ( 2 β 2 ) U + 3 γ p / ρ ] + 1 2 ( v j , k t k ^ m ^ v m t j ^ m ^ v m , k v k ) + 1 2 [ v m ( t m ^ j ^ v k ) , k v j ( t k ^ ı ^ v k ) , ] = 0 . ρ d v j d t ρ U , j + t j ^ k ^ ( 1 + 3 γ U ) , k t j ^ k ^ , k 1 2 v 2 + Π t j ^ k t k ^ , ρ + ρ d d t ( 2 γ + 2 ) U v j 1 2 7 Δ 1 + Δ 2 V j 1 2 α 1 U w j v j ρ U , t + v k t k ^ j ^ , t + 1 2 Δ 2 ρ V j W j , t + 1 2 ρ 7 Δ 1 + Δ 2 v k + α 1 2 α 3 w k V k , j (39.47) ρ 2 Ψ 1 2 ζ a 1 2 η D 1 2 α 2 w i w k U i k + α 2 w i V i W i , j ρ U , j γ v 2 1 2 α 1 w v + 1 2 α 2 + α 3 α 1 w 2 ( 2 β 2 ) U + 3 γ p / ρ + 1 2 v j , k t k ^ m ^ v m t j ^ m ^ v m , k v k + 1 2 v m t m ^ j ^ v k , k v j t k ^ ı ^ v k , = 0 . {:[rho^(**)(dv_(j))/(dt)-rho^(**)U_(,j)+[t_( hat(j) hat(k))(1+3gamma U)]_(,k)-t_( hat(j) hat(k),k)((1)/(2)v^(2)+Pi)-(t_( hat(j)k)t_( hat(k)ℓ,ℓ))/(rho^(**))],[+rho^(**)(d)/(dt)[(2gamma+2)Uv_(j)-(1)/(2)(7Delta_(1)+Delta_(2))V_(j)-(1)/(2)alpha_(1)Uw_(j)]-v_(j)rho^(**)U_(,t)+v_(k)t_( hat(k) hat(j),t)],[+(1)/(2)Delta_(2)rho^(**)(V_(j)-W_(j))_(,t)+(1)/(2)rho^(**)[(7Delta_(1)+Delta_(2))v_(k)+(alpha_(1)-2alpha_(3))w_(k)]V_(k,j)],[(39.47)-rho^(**)[2Psi-(1)/(2)zeta a-(1)/(2)etaD-(1)/(2)alpha_(2)w_(i)w_(k)U_(ik)+alpha_(2)w_(i)(V_(i)-W_(i))]_(,j)],[-rho^(**)U_(,j)[gammav^(2)-(1)/(2)alpha_(1)w*v+(1)/(2)(alpha_(2)+alpha_(3)-alpha_(1))w^(2)-(2beta-2)U+3gamma p//rho^(**)]],[+(1)/(2)(v_(j,k)t_( hat(k) hat(m))v_(m)-t_( hat(j) hat(m))v_(m,k)v_(k))+(1)/(2)[v_(m)(t_( hat(m) hat(j))v_(k))_(,k)-v_(j)(t_( hat(k) hat(ı))v_(k))_(,ℓ)]=0.]:}\begin{align*} \rho^{*} \frac{d v_{j}}{d t} & -\rho^{*} U_{, j}+\left[t_{\hat{j} \hat{k}}(1+3 \gamma U)\right]_{, k}-t_{\hat{j} \hat{k}, k}\left(\frac{1}{2} v^{2}+\Pi\right)-\frac{t_{\hat{j} k} t_{\hat{k} \ell, \ell}}{\rho^{*}} \\ & +\rho^{*} \frac{d}{d t}\left[(2 \gamma+2) U v_{j}-\frac{1}{2}\left(7 \Delta_{1}+\Delta_{2}\right) V_{j}-\frac{1}{2} \alpha_{1} U w_{j}\right]-v_{j} \rho^{*} U_{, t}+v_{k} t_{\hat{k} \hat{j}, t} \\ & +\frac{1}{2} \Delta_{2} \rho^{*}\left(V_{j}-W_{j}\right)_{, t}+\frac{1}{2} \rho^{*}\left[\left(7 \Delta_{1}+\Delta_{2}\right) v_{k}+\left(\alpha_{1}-2 \alpha_{3}\right) w_{k}\right] V_{k, j} \\ & -\rho^{*}\left[2 \Psi-\frac{1}{2} \zeta a-\frac{1}{2} \eta \mathscr{D}-\frac{1}{2} \alpha_{2} w_{i} w_{k} U_{i k}+\alpha_{2} w_{i}\left(V_{i}-W_{i}\right)\right]_{, j} \tag{39.47}\\ & -\rho^{*} U_{, j}\left[\gamma v^{2}-\frac{1}{2} \alpha_{1} w \cdot v+\frac{1}{2}\left(\alpha_{2}+\alpha_{3}-\alpha_{1}\right) w^{2}-(2 \beta-2) U+3 \gamma p / \rho^{*}\right] \\ & +\frac{1}{2}\left(v_{j, k} t_{\hat{k} \hat{m}} v_{m}-t_{\hat{j} \hat{m}} v_{m, k} v_{k}\right)+\frac{1}{2}\left[v_{m}\left(t_{\hat{m} \hat{j}} v_{k}\right)_{, k}-v_{j}\left(t_{\hat{k} \hat{\imath}} v_{k}\right)_{, \ell}\right]=0 . \end{align*}ρdvjdtρU,j+[tj^k^(1+3γU)],ktj^k^,k(12v2+Π)tj^ktk^,ρ+ρddt[(2γ+2)Uvj12(7Δ1+Δ2)Vj12α1Uwj]vjρU,t+vktk^j^,t+12Δ2ρ(VjWj),t+12ρ[(7Δ1+Δ2)vk+(α12α3)wk]Vk,j(39.47)ρ[2Ψ12ζa12ηD12α2wiwkUik+α2wi(ViWi)],jρU,j[γv212α1wv+12(α2+α3α1)w2(2β2)U+3γp/ρ]+12(vj,ktk^m^vmtj^m^vm,kvk)+12[vm(tm^j^vk),kvj(tk^ı^vk),]=0.
Partial derivatives are denoted by commas; d / d t d / d t d//dtd / d td/dt is the time-derivative following the matter [equation (39.16)].
Equations (39.44), (39.46), and (39.47) are a complete set of equations of motion at the post-Newtonian order.

Exercise 39.13. EQUATIONS OF MOTION

EXERCISES
Carry out the details of the derivation of the equations of motion (39.44), (39.46), and (39.47). As part of the derivation, calculate the following values of the Christoffel symbols in the PPN coordinate frame:
Γ 0 00 = U , t + O ( U , j ϵ 3 ) , Γ 0 j 0 = U , j + O ( U , j ϵ 2 ) , Γ 0 j k = γ U , t δ j k + 7 2 Δ 1 V ( j , k ) + 1 2 Δ 2 W ( j , k ) + ( 1 2 α 1 α 2 ) w ( j U , k ) + α 2 w i U i ( j , k ) + O ( U , j ϵ 3 ) . Γ j 00 = U , j + [ ( β + γ ) U 2 2 Ψ + 1 2 ζ a + 1 2 η η D + 1 2 ( α 1 α 2 α 3 ) w 2 U (39.48) + 1 2 ( α 1 2 α 3 ) w i V i + 1 2 α 2 w i w k U i k ] , j 7 2 Δ 1 V j , t 1 2 Δ 2 W j , t + ( α 2 1 2 α 1 ) w j U , t α 2 w i U i j , t + O ( U , j ϵ 4 ) , Γ j 0 k = γ U , t δ j k ( 7 2 Δ 1 + 1 2 Δ 2 ) V [ j , k ] 1 2 α 1 w [ j U , k ] + O ( U , j ϵ 3 ) , Γ j k = γ ( U , j δ k 2 U , ( k δ j ) + O ( U , j ϵ 2 ) . Γ 0 00 = U , t + O U , j ϵ 3 , Γ 0 j 0 = U , j + O U , j ϵ 2 , Γ 0 j k = γ U , t δ j k + 7 2 Δ 1 V ( j , k ) + 1 2 Δ 2 W ( j , k ) + 1 2 α 1 α 2 w ( j U , k ) + α 2 w i U i ( j , k ) + O U , j ϵ 3 . Γ j 00 = U , j + ( β + γ ) U 2 2 Ψ + 1 2 ζ a + 1 2 η η D + 1 2 α 1 α 2 α 3 w 2 U (39.48) + 1 2 α 1 2 α 3 w i V i + 1 2 α 2 w i w k U i k , j 7 2 Δ 1 V j , t 1 2 Δ 2 W j , t + α 2 1 2 α 1 w j U , t α 2 w i U i j , t + O U , j ϵ 4 , Γ j 0 k = γ U , t δ j k 7 2 Δ 1 + 1 2 Δ 2 V [ j , k ] 1 2 α 1 w [ j U , k ] + O U , j ϵ 3 , Γ j k = γ U , j δ k 2 U , ( k δ j + O U , j ϵ 2 . {:[Gamma^(0)_(00)=-U_(,t)+O(U_(,j)epsilon^(3))","quadGamma_(0j)^(0)=-U_(,j)+O(U_(,j)epsilon^(2))","],[Gamma^(0)_(jk)=gammaU_(,t)delta_(jk)+(7)/(2)Delta_(1)V_((j,k))+(1)/(2)Delta_(2)W_((j,k))+((1)/(2)alpha_(1)-alpha_(2))w_((j)U_(,k))],[+alpha_(2)w_(i)U_(i(j,k))+O(U_(,j)epsilon^(3)).],[Gamma^(j)_(00)=-U_(,j)+[(beta+gamma)U^(2)-2Psi+(1)/(2)zeta a+(1)/(2)etaeta^(D)+(1)/(2)(alpha_(1)-alpha_(2)-alpha_(3))w^(2)U:}],[(39.48)+(1)/(2)(alpha_(1)-2alpha_(3))w_(i)V_(i)+(1)/(2)alpha_(2)w_(i)w_(k)U_(ik)]_(,j)-(7)/(2)Delta_(1)V_(j,t)-(1)/(2)Delta_(2)W_(j,t)],[+(alpha_(2)-(1)/(2)alpha_(1))w_(j)U_(,t)-alpha_(2)w_(i)U_(ij,t)+O(U_(,j)epsilon^(4))","],[Gamma^(j)_(0k)=gammaU_(,t)delta_(jk)-((7)/(2)Delta_(1)+(1)/(2)Delta_(2))V_([j,k])-(1)/(2)alpha_(1)w_([j)U_(,k])+O(U_(,j)epsilon^(3))","],[Gamma^(j)_(kℓ)=-gamma(U_(,j)delta_(kℓ)-2U_(,(k)delta_(ℓj))+O(U_(,j)epsilon^(2)).]:}\begin{align*} \Gamma^{0}{ }_{00}= & -U_{, t}+O\left(U_{, j} \epsilon^{3}\right), \quad \Gamma_{0 j}^{0}=-U_{, j}+O\left(U_{, j} \epsilon^{2}\right), \\ \Gamma^{0}{ }_{j k}= & \gamma U_{, t} \delta_{j k}+\frac{7}{2} \Delta_{1} V_{(j, k)}+\frac{1}{2} \Delta_{2} W_{(j, k)}+\left(\frac{1}{2} \alpha_{1}-\alpha_{2}\right) w_{(j} U_{, k)} \\ & +\alpha_{2} w_{i} U_{i(j, k)}+O\left(U_{, j} \epsilon^{3}\right) . \\ \Gamma^{j}{ }_{00}= & -U_{, j}+\left[(\beta+\gamma) U^{2}-2 \Psi+\frac{1}{2} \zeta a+\frac{1}{2} \eta \eta^{\mathscr{D}}+\frac{1}{2}\left(\alpha_{1}-\alpha_{2}-\alpha_{3}\right) w^{2} U\right. \\ & \left.+\frac{1}{2}\left(\alpha_{1}-2 \alpha_{3}\right) w_{i} V_{i}+\frac{1}{2} \alpha_{2} w_{i} w_{k} U_{i k}\right]_{, j}-\frac{7}{2} \Delta_{1} V_{j, t}-\frac{1}{2} \Delta_{2} W_{j, t} \tag{39.48}\\ & +\left(\alpha_{2}-\frac{1}{2} \alpha_{1}\right) w_{j} U_{, t}-\alpha_{2} w_{i} U_{i j, t}+O\left(U_{, j} \epsilon^{4}\right), \\ \Gamma^{j}{ }_{0 k}= & \gamma U_{, t} \delta_{j k}-\left(\frac{7}{2} \Delta_{1}+\frac{1}{2} \Delta_{2}\right) V_{[j, k]}-\frac{1}{2} \alpha_{1} w_{[j} U_{, k]}+O\left(U_{, j} \epsilon^{3}\right), \\ \Gamma^{j}{ }_{k \ell}= & -\gamma\left(U_{, j} \delta_{k \ell}-2 U_{,(k} \delta_{\ell j}\right)+O\left(U_{, j} \epsilon^{2}\right) . \end{align*}Γ000=U,t+O(U,jϵ3),Γ0j0=U,j+O(U,jϵ2),Γ0jk=γU,tδjk+72Δ1V(j,k)+12Δ2W(j,k)+(12α1α2)w(jU,k)+α2wiUi(j,k)+O(U,jϵ3).Γj00=U,j+[(β+γ)U22Ψ+12ζa+12ηηD+12(α1α2α3)w2U(39.48)+12(α12α3)wiVi+12α2wiwkUik],j72Δ1Vj,t12Δ2Wj,t+(α212α1)wjU,tα2wiUij,t+O(U,jϵ4),Γj0k=γU,tδjk(72Δ1+12Δ2)V[j,k]12α1w[jU,k]+O(U,jϵ3),Γjk=γ(U,jδk2U,(kδj)+O(U,jϵ2).
Here square brackets on tensor indices denote antisymmetrization, and round brackets denote symmetrization. As part of the derivation, it may be useful to prove and use the relations
(39.49a) χ ( t , x ) = ρ o ( t , x ) | x x | d 3 x (39.49b) χ , j k = δ j k U + U j k (39.49c) χ , i t = V i W i + O ( ϵ 5 ) (39.50) W [ k , j ] = V [ k , j ] . (39.49a) χ ( t , x ) = ρ o t , x x x d 3 x (39.49b) χ , j k = δ j k U + U j k (39.49c) χ , i t = V i W i + O ϵ 5 (39.50) W [ k , j ] = V [ k , j ] . {:[(39.49a)chi(t","x)=-intrho_(o)(t,x^('))|x-x^(')|d^(3)x^(')],[(39.49b)chi_(,jk)=-delta_(jk)U+U_(jk)],[(39.49c)chi_(,it)=V_(i)-W_(i)+O(epsilon^(5))],[(39.50)W_([k,j])=V_([k,j]).]:}\begin{align*} \chi(t, x) & =-\int \rho_{o}\left(t, x^{\prime}\right)\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right| d^{3} x^{\prime} \tag{39.49a}\\ \chi_{, j k} & =-\delta_{j k} U+U_{j k} \tag{39.49b}\\ \chi_{, i t} & =V_{i}-W_{i}+O\left(\epsilon^{5}\right) \tag{39.49c}\\ W_{[k, j]} & =V_{[k, j]} . \tag{39.50} \end{align*}(39.49a)χ(t,x)=ρo(t,x)|xx|d3x(39.49b)χ,jk=δjkU+Ujk(39.49c)χ,it=ViWi+O(ϵ5)(39.50)W[k,j]=V[k,j].
Here χ χ chi\chiχ is the function originally defined in equation (39.29c).
Exercise 39.14. POST-NEWTONIAN APPROXIMATION TO GENERAL RELATIVITY
Perform a post-Newtonian expansion of Einstein's field equations, thereby obtaining the values cited in Box 39.2 for the PPN parameters of general relativity. The calculations might best follow the approach of Chandrasekhar (1965a): Set g α β = η α β + h α β g α β = η α β + h α β g_(alpha beta)=eta_(alpha beta)+h_(alpha beta)g_{\alpha \beta}=\eta_{\alpha \beta}+h_{\alpha \beta}gαβ=ηαβ+hαβ, and assume
(39.51) h 00 = O ( ϵ 2 ) + O ( ϵ 4 ) , h 0 j = O ( ϵ 3 ) , h j k = O ( ϵ 2 ) (39.51) h 00 = O ϵ 2 + O ϵ 4 , h 0 j = O ϵ 3 , h j k = O ϵ 2 {:(39.51)h_(00)=O(epsilon^(2))+O(epsilon^(4))","quadh_(0j)=O(epsilon^(3))","quadh_(jk)=O(epsilon^(2)):}\begin{equation*} h_{00}=O\left(\epsilon^{2}\right)+O\left(\epsilon^{4}\right), \quad h_{0 j}=O\left(\epsilon^{3}\right), \quad h_{j k}=O\left(\epsilon^{2}\right) \tag{39.51} \end{equation*}(39.51)h00=O(ϵ2)+O(ϵ4),h0j=O(ϵ3),hjk=O(ϵ2)
Choose the space and time coordinates so that the four "gauge conditions"
(39.52) h j k , k 1 2 h , j = O ( ϵ 4 / R ) h 0 k , k 1 2 h k k , 0 = O ( ϵ 5 / R ) } with h = h α β η α β = h 00 + h l (39.52) h j k , k 1 2 h , j = O ϵ 4 / R h 0 k , k 1 2 h k k , 0 = O ϵ 5 / R  with  h = h α β η α β = h 00 + h l {:(39.52){:[h_(jk,k)-(1)/(2)h_(,j)=O(epsilon^(4)//R_(o.))],[h_(0k,k)-(1)/(2)h_(kk,0)=O(epsilon^(5)//R_(o.))]}" with "h=h_(alpha beta)eta^(alpha beta)=-h_(00)+h_(lℓ):}\left.\begin{array}{l} h_{j k, k}-\frac{1}{2} h_{, j}=O\left(\epsilon^{4} / R_{\odot}\right) \tag{39.52}\\ h_{0 k, k}-\frac{1}{2} h_{k k, 0}=O\left(\epsilon^{5} / R_{\odot}\right) \end{array}\right\} \text { with } h=h_{\alpha \beta} \eta^{\alpha \beta}=-h_{00}+h_{l \ell}(39.52)hjk,k12h,j=O(ϵ4/R)h0k,k12hkk,0=O(ϵ5/R)} with h=hαβηαβ=h00+hl
are satisfied.
(a) Show that the spatial gauge conditions are the post-Newtonian approximations to those (35.1a) used in the study of weak gravitational waves, but that the temporal gauge condition is not.
(b) Use these gauge conditions and the post-Newtonian limit in equations (8.24) and (8.47) to obtain for the Ricci tensor, accurate to linearized order,
(39.53a) R 00 = 1 2 h 00 , m m + O ( ϵ 4 / R 2 ) , R j k = 1 2 h j k , m m + O ( ϵ 4 / R 2 ) (39.53b) R 0 j = 1 2 h 0 j , m m 1 4 h 00 , 0 j + O ( ϵ 5 / R 2 ) (39.53a) R 00 = 1 2 h 00 , m m + O ϵ 4 / R 2 , R j k = 1 2 h j k , m m + O ϵ 4 / R 2 (39.53b) R 0 j = 1 2 h 0 j , m m 1 4 h 00 , 0 j + O ϵ 5 / R 2 {:[(39.53a)R_(00)=-(1)/(2)h_(00,mm)+O(epsilon^(4)//R_(o.)^(2))","quadR_(jk)=-(1)/(2)h_(jk,mm)+O(epsilon^(4)//R_(o.)^(2))],[(39.53b)R_(0j)=-(1)/(2)h_(0j,mm)-(1)/(4)h_(00,0j)+O(epsilon^(5)//R_(o.)^(2))]:}\begin{gather*} R_{00}=-\frac{1}{2} h_{00, m m}+O\left(\epsilon^{4} / R_{\odot}^{2}\right), \quad R_{j k}=-\frac{1}{2} h_{j k, m m}+O\left(\epsilon^{4} / R_{\odot}^{2}\right) \tag{39.53a}\\ R_{0 j}=-\frac{1}{2} h_{0 j, m m}-\frac{1}{4} h_{00,0 j}+O\left(\epsilon^{5} / R_{\odot}^{2}\right) \tag{39.53b} \end{gather*}(39.53a)R00=12h00,mm+O(ϵ4/R2),Rjk=12hjk,mm+O(ϵ4/R2)(39.53b)R0j=12h0j,mm14h00,0j+O(ϵ5/R2)
(c) Combine these with the Newtonian form (39.13) of the stress-energy tensor, and with equation (39.27), to obtain the following metric coefficients, accurate to linearized order:
(39.54) h 00 = 2 U + k 00 + O ( ϵ 6 ) , h 0 j = 7 2 V j 1 2 W j + O ( ϵ 5 ) , [unknown post-Newtonian correction] h j k = 2 U δ j k + O ( ϵ 4 ) . (39.54) h 00 = 2 U + k 00 + O ϵ 6 , h 0 j = 7 2 V j 1 2 W j + O ϵ 5 ,  [unknown post-Newtonian correction]  h j k = 2 U δ j k + O ϵ 4 . {:[(39.54)h_(00)=2U+k_(00)+O(epsilon^(6))","quadh_(0j)=-(7)/(2)V_(j)-(1)/(2)W_(j)+O(epsilon^(5))","],[" [unknown post-Newtonian correction] "],[h_(jk)=2Udelta_(jk)+O(epsilon^(4)).]:}\begin{gather*} h_{00}=2 U+k_{00}+O\left(\epsilon^{6}\right), \quad h_{0 j}=-\frac{7}{2} V_{j}-\frac{1}{2} W_{j}+O\left(\epsilon^{5}\right), \tag{39.54}\\ \text { [unknown post-Newtonian correction] } \\ h_{j k}=2 U \delta_{j k}+O\left(\epsilon^{4}\right) . \end{gather*}(39.54)h00=2U+k00+O(ϵ6),h0j=72Vj12Wj+O(ϵ5), [unknown post-Newtonian correction] hjk=2Uδjk+O(ϵ4).
Here U , V j U , V j U,V_(j)U, V_{j}U,Vj, and W j W j W_(j)W_{j}Wj are to be regarded as defined by equations (39.34a,b,c). By comparing these metric coefficients with equations (39.32), discover that
(39.55) γ = 1 , Δ 1 = 1 , Δ 2 = 1 (39.55) γ = 1 , Δ 1 = 1 , Δ 2 = 1 {:(39.55)gamma=1","quadDelta_(1)=1","quadDelta_(2)=1:}\begin{equation*} \gamma=1, \quad \Delta_{1}=1, \quad \Delta_{2}=1 \tag{39.55} \end{equation*}(39.55)γ=1,Δ1=1,Δ2=1
for general relativity.
(d) With this knowledge of the metric in linearized order, one can carry out the analysis of § 39.10 § 39.10 §39.10\S 39.10§39.10 (using γ = Δ 1 = Δ 2 = 1 γ = Δ 1 = Δ 2 = 1 gamma=Delta_(1)=Delta_(2)=1\gamma=\Delta_{1}=\Delta_{2}=1γ=Δ1=Δ2=1 throughout), to obtain the post-Newtonian corrections to the stress-energy tensor [equation (39.42) with γ = 1 γ = 1 gamma=1\gamma=1γ=1 ].
(e) Calculate, similarly, the post-Newtonian corrections to the Ricci tensor component R 00 R 00 R_(00)R_{00}R00, using g α β = η α β + h α β g α β = η α β + h α β g_(alpha beta)=eta_(alpha beta)+h_(alpha beta)g_{\alpha \beta}=\eta_{\alpha \beta}+h_{\alpha \beta}gαβ=ηαβ+hαβ, using h α β h α β h_(alpha beta)h_{\alpha \beta}hαβ as given in equations (39.54), and using the gauge conditions (39.52). The answer should be
(39.56) R 00 = ( U 1 2 k 00 U 2 ) , m m + 4 U U , m m + O ( ϵ 6 / R 2 ) (39.56) R 00 = U 1 2 k 00 U 2 , m m + 4 U U , m m + O ϵ 6 / R 2 {:(39.56)R_(00)=(-U-(1)/(2)k_(00)-U^(2))_(,mm)+4UU_(,mm)+O(epsilon^(6)//R_(o.)^(2)):}\begin{equation*} R_{00}=\left(-U-\frac{1}{2} k_{00}-U^{2}\right)_{, m m}+4 U U_{, m m}+O\left(\epsilon^{6} / R_{\odot}^{2}\right) \tag{39.56} \end{equation*}(39.56)R00=(U12k00U2),mm+4UU,mm+O(ϵ6/R2)
(f) Evaluate the Einstein equation R 00 = 8 π ( T 00 1 2 g 00 T ) R 00 = 8 π T 00 1 2 g 00 T R_(00)=8pi(T_(00)-(1)/(2)g_(00)T)R_{00}=8 \pi\left(T_{00}-\frac{1}{2} g_{00} T\right)R00=8π(T0012g00T), accurate to post-Newtonian order, and solve it to obtain the post-Newtonian metric correction
(39.57) k 00 = 2 U 2 + 4 Ψ (39.57) k 00 = 2 U 2 + 4 Ψ {:(39.57)k_(00)=-2U^(2)+4Psi:}\begin{equation*} k_{00}=-2 U^{2}+4 \Psi \tag{39.57} \end{equation*}(39.57)k00=2U2+4Ψ
where Ψ Ψ Psi\PsiΨ is given by equations (39.34d) with β 1 = β 2 = β 3 = β 4 = 1 β 1 = β 2 = β 3 = β 4 = 1 beta_(1)=beta_(2)=beta_(3)=beta_(4)=1\beta_{1}=\beta_{2}=\beta_{3}=\beta_{4}=1β1=β2=β3=β4=1. By comparing with equations (39.32c) and (39.34d), discover that
(39.58) β = β 1 = β 2 = β 3 = β 4 = 1 , ζ = η = 0 (39.58) β = β 1 = β 2 = β 3 = β 4 = 1 , ζ = η = 0 {:(39.58)beta=beta_(1)=beta_(2)=beta_(3)=beta_(4)=1","quad zeta=eta=0:}\begin{equation*} \beta=\beta_{1}=\beta_{2}=\beta_{3}=\beta_{4}=1, \quad \zeta=\eta=0 \tag{39.58} \end{equation*}(39.58)β=β1=β2=β3=β4=1,ζ=η=0
for general relativity.
(g) Knowing the full post-Newtonian metric, and the full post-Newtonian stress-energy tensor, one can carry out the calculations of § 39.11 § 39.11 §39.11\S 39.11§39.11 (using γ = β = β 1 = β 2 = β 3 = β 4 = γ = β = β 1 = β 2 = β 3 = β 4 = gamma=beta=beta_(1)=beta_(2)=beta_(3)=beta_(4)=\gamma=\beta=\beta_{1}=\beta_{2}=\beta_{3}=\beta_{4}=γ=β=β1=β2=β3=β4= Δ 1 = Δ 2 = 1 , ζ = η = 0 Δ 1 = Δ 2 = 1 , ζ = η = 0 Delta_(1)=Delta_(2)=1,zeta=eta=0\Delta_{1}=\Delta_{2}=1, \zeta=\eta=0Δ1=Δ2=1,ζ=η=0 ) to obtain the post-Newtonian equations of motion for the matter [equations (39.44), (39.46), and (39.47)].

§39.12. RELATION OF PPN COORDINATES TO SURROUNDING UNIVERSE

One crucial issue remains to be clarified: What is the orientation of the PPN coordinate system relative to the surrounding universe? More particularly: Does the PPN coordinate system rotate relative to the "fixed stars on the sky;" or is it "rigidly attached" to them, in some sense? In order to answer this question, imagine using the PPN formalism to analyze the solar system. Make no assumptions about the solar system's velocity through the PPN coordinate frame. Then, as one moves outward from the Sun, past the Earth's orbit, past Pluto's orbit, and on out toward interstellar space, one sees the PPN coordinate frame become more and more Lorentz in its global properties [ g α β = η α β + O ( M / r ) g α β = η α β + O M / r [g_(alpha beta)=eta_(alpha beta)+O(M_(o.)//r):}\left[g_{\alpha \beta}=\eta_{\alpha \beta}+O\left(M_{\odot} / r\right)\right.[gαβ=ηαβ+O(M/r) ]. Thus, far from the solar system the PPN coordinates become a "Lorentz frame moving through the galaxy." This means, of course, that the spatial axes of the PPN coordinate frame behave as though they were attached to gyroscopes far outside the solar system. Equivalently: The PPN coordinate system Fermi-Walker-transports its spatial axes through the spacetime geometry of the galaxy and universe.

§39.13. SUMMARY OF PPN FORMALISM

The PPN formalism, as constructed in this chapter, is summarized in Box 39.4. Much of the recent literature uses a different set of PPN parameters than are used in this book; for a translation from one parameter set to the other, see Box 39.5.

Exercise 39.15. MANY-BODY SYSTEM IN POST-NEWTONIAN LIMIT OF GENERAL RELATIVITY

Consider, in the post-Newtonian limit of general relativity, a system made up of many gravitationally interacting bodies with separations large compared to their sizes (example: the solar system). Idealize each body to be spherically symmetric, to be free of internal motions, and to have isotropic internal stresses, t j k = δ j k p t j k = δ j k p t_(jk)=delta_(jk)pt_{j k}=\delta_{j k} ptjk=δjkp. Let the world line of the center of body A A AAA, in some chosen PPN coordinate frame, be x A ( t ) x A ( t ) x_(A)(t)\boldsymbol{x}_{A}(t)xA(t); and let the (coordinate) velocity of the center of body A A AAA be
(39.59a) v A ( t ) = d x A / d t . (39.59a) v A ( t ) = d x A / d t . {:(39.59a)v_(A)(t)=dx_(A)//dt.:}\begin{equation*} v_{\boldsymbol{A}}(t)=d x_{A} / d t . \tag{39.59a} \end{equation*}(39.59a)vA(t)=dxA/dt.
The total mass-energy of body A A AAA as measured in its neighborhood (rest mass-energy plus internal energy plus self-gravitational energy) is given by
(39.59b) M A = V A ( 1 + Π 1 2 U self ) d ( rest mass ) + O ( M A ϵ 4 ) (39.59b) M A = V A 1 + Π 1 2 U self  d (  rest mass  ) + O M A ϵ 4 {:(39.59b)M_(A)=int_(V_(A))(1+Pi-(1)/(2)U_("self "))d(" rest mass ")+O(M_(A)epsilon^(4)):}\begin{equation*} M_{A}=\int_{V_{A}}\left(1+\Pi-\frac{1}{2} U_{\text {self }}\right) d(\text { rest mass })+O\left(M_{A} \epsilon^{4}\right) \tag{39.59b} \end{equation*}(39.59b)MA=VA(1+Π12Uself )d( rest mass )+O(MAϵ4)
where U self U self  U_("self ")U_{\text {self }}Uself  is the body's own Newtonian potential (no contributions from other bodies), and V A V A V_(A)\mathscr{V}_{A}VA is the interior of the body.
Solar system's PPN coordinate frame is attached to a local Lorentz frame of Galaxy

Box 39.4 SUMMARY OF THE PPN FORMALISM

I. Variables

ρ o ( x , t ) ρ o ( x , t ) rho_(o)(x,t)\rho_{o}(\boldsymbol{x}, t)ρo(x,t) : baryon "mass" density ( § 39.3 ) § 39.3 {:§39.3)\left.\S 39.3\right)§39.3), as measured in rest frame
Π ( x , t ) Π ( x , t ) Pi(x,t)\Pi(x, t)Π(x,t) : specific internal energy (dimensionless; § 39.3 § 39.3 §39.3\S 39.3§39.3 ), as measured in rest frame t j k ^ ( x , t ) t j k ^ ( x , t ) t_(j hat(k))(x,t)t_{j \hat{k}}(x, t)tjk^(x,t) : components of stress referred to orthonormal axes of rest frame v j ( x , t ) v j ( x , t ) v_(j)(x,t)v_{j}(x, t)vj(x,t) : coordinate velocity of matter (i.e., rest frame) relative to PPN coordinates U ( x , t ) , Ψ ( x , t ) , A ( x , t ) , D ( x , t ) , V j ( x , t ) , W j ( x , t ) , U j k ( x , t ) U ( x , t ) , Ψ ( x , t ) , A ( x , t ) , D ( x , t ) , V j ( x , t ) , W j ( x , t ) , U j k ( x , t ) U(x,t),Psi(x,t),A(x,t),D(x,t),V_(j)(x,t),W_(j)(x,t),U_(jk)(x,t)U(x, t), \Psi(x, t), \mathscr{A}(x, t), \mathscr{D}(x, t), V_{j}(x, t), W_{j}(x, t), U_{j k}(x, t)U(x,t),Ψ(x,t),A(x,t),D(x,t),Vj(x,t),Wj(x,t),Ujk(x,t) : gravitational potentials γ , β , β 1 , β 2 , β 3 , β 4 , Δ 1 , Δ 2 , ζ , η γ , β , β 1 , β 2 , β 3 , β 4 , Δ 1 , Δ 2 , ζ , η gamma,beta,beta_(1),beta_(2),beta_(3),beta_(4),Delta_(1),Delta_(2),zeta,eta\gamma, \beta, \beta_{1}, \beta_{2}, \beta_{3}, \beta_{4}, \Delta_{1}, \Delta_{2}, \zeta, \etaγ,β,β1,β2,β3,β4,Δ1,Δ2,ζ,η : parameters whose values distinguish one theory from another (see Box 39.2)
w w w\boldsymbol{w}w : velocity of PPN coordinate frame relative to "universal rest frame" [relevant only for theories with nonzero α 1 , α 2 α 1 , α 2 alpha_(1),alpha_(2)\alpha_{1}, \alpha_{2}α1,α2, or α 3 α 3 alpha_(3)\alpha_{3}α3; see eq. (39.33)].
II. Equations governing evolution of these variables
ρ o ρ o rho_(o)\rho_{o}ρo : conservation of rest mass, equation (39.44)
Π Π Pi\PiΠ : first law of thermodynamics, equation (39.46)
t j k t j k t_(jk)t_{j k}tjk : determined in terms of ρ o , Π ρ o , Π rho_(o),Pi\rho_{o}, \Piρo,Π, and other material variables (chemical composition, strains, etc.) by equations of state and the usual theory of a stressed medium-which is not discussed here
v j v j v_(j)v_{j}vj : equations of motion (" F = m a F = m a F=ma\boldsymbol{F}=\boldsymbol{m} \boldsymbol{a}F=ma "), equations (39.47)
U , Ψ , A , D , V j , W j , U j k U , Ψ , A , D , V j , W j , U j k U,Psi,A,D,V_(j),W_(j),U_(jk)U, \Psi, \mathscr{A}, \mathscr{D}, V_{j}, W_{j}, U_{j k}U,Ψ,A,D,Vj,Wj,Ujk : source equations (39.34)
III. Quantities to be calculated from these variables
g 00 ( x , t ) , g 0 j ( x , t ) , g j k ( x , t ) g 00 ( x , t ) , g 0 j ( x , t ) , g j k ( x , t ) g_(00)(x,t),g_(0j)(x,t),g_(jk)(x,t)g_{00}(x, t), g_{0 j}(x, t), g_{j k}(x, t)g00(x,t),g0j(x,t),gjk(x,t) : these components of metric in PPN coordinate frame are expressed in terms of gravitational potentials by equations (39.32)
u 0 ( x , t ) , u j ( x , t ) u 0 ( x , t ) , u j ( x , t ) u^(0)(x,t),u^(j)(x,t)u^{0}(\boldsymbol{x}, t), u^{j}(\boldsymbol{x}, t)u0(x,t),uj(x,t) : these components of matter 4 -velocity in PPN coordinate frame are given by equations (39.39)
T 00 ( x , t ) , T 0 j ( x , t ) , T j k ( x , t ) T 00 ( x , t ) , T 0 j ( x , t ) , T j k ( x , t ) T^(00)(x,t),T^(0j)(x,t),T^(jk)(x,t)T^{00}(x, t), T^{0 j}(x, t), T^{j k}(x, t)T00(x,t),T0j(x,t),Tjk(x,t) : these components of stress-energy tensor in PPN coordinate frame are given by equations (39.42)
IV. Relation between rest frame, PPN coordinates, and the universe
  1. Orthonormal basis ω α ^ ω α ^ omega^( hat(alpha))\boldsymbol{\omega}^{\hat{\alpha}}ωα^ of rest frame, where t j k ^ t j k ^ t_(j hat(k))t_{j \hat{k}}tjk^ are defined, is related to PPN coordinate basis d x α d x α dx^(alpha)\boldsymbol{d} x^{\alpha}dxα by equations (39.41)
  2. Far from the sun, the PPN coordinates become asymptotically Lorentz; i.e., they form an inertial frame moving through the spacetime geometry of the galaxy and the universe.
  3. Gives no account of expansion of universe or of cosmic gravitational waves impinging on solar system.

Box 39.5 PPN PARAMETERS USED IN LITERATURE: A TRANSLATOR'S GUIDE

The original "point-particle version" of the PPN formalism [Nordtvedt (1968b)], and the original "perfect-fluid version" [Will (1971c)] used different sets of PPN parameters. This book has adopted Will's set, and has added the parameter η η eta\etaη characterizing effects of anisotropic stresses. More recently, Will and Nordtvedt have jointly adopted a revised set of parameters, described below.
A. Translation Table
Will-Nordtvedt
revised parameters
Will-Nordtvedt revised parameters| Will-Nordtvedt | | :--- | | revised parameters |
Revised parameters in
notation of this book
Revised parameters in notation of this book| Revised parameters in | | :--- | | notation of this book |
Revised parameters in
notation of Nordtvedt ( 1968b ) c (  1968b  ) c (" 1968b ")^(c)(\text { 1968b })^{\mathrm{c}}( 1968b )c
Revised parameters in notation of Nordtvedt (" 1968b ")^(c)| Revised parameters in | | :--- | | notation of Nordtvedt $(\text { 1968b })^{\mathrm{c}}$ |
γ γ gamma\gammaγ γ γ gamma\gammaγ γ γ gamma\gammaγ
β β beta\betaβ β β beta\betaβ β β beta\betaβ
α 1 α 1 alpha_(1)\alpha_{1}α1 7 Δ 1 + Δ 2 4 γ 4 7 Δ 1 + Δ 2 4 γ 4 7Delta_(1)+Delta_(2)-4gamma-47 \Delta_{1}+\Delta_{2}-4 \gamma-47Δ1+Δ24γ4 8 Δ 4 γ 4 8 Δ 4 γ 4 8Delta-4gamma-48 \Delta-4 \gamma-48Δ4γ4
α 2 α 2 alpha_(2)\alpha_{2}α2 Δ 2 + ζ 1 Δ 2 + ζ 1 Delta_(2)+zeta-1\Delta_{2}+\zeta-1Δ2+ζ1 α 1 α 1 alpha^(''')-1\alpha^{\prime \prime \prime}-1α1
α 3 α 3 alpha_(3)\alpha_{3}α3 4 β 1 2 γ 2 ζ 4 β 1 2 γ 2 ζ 4beta_(1)-2gamma-2-zeta4 \beta_{1}-2 \gamma-2-\zeta4β12γ2ζ 4 α α 2 γ 1 4 α α 2 γ 1 4alpha^('')-alpha^(''')-2gamma-14 \alpha^{\prime \prime}-\alpha^{\prime \prime \prime}-2 \gamma-14αα2γ1
ζ 1 ζ 1 zeta_(1)\zeta_{1}ζ1 ζ ζ zeta\zetaζ α χ α χ alpha^(''')-chi\alpha^{\prime \prime \prime}-\chiαχ
ζ 2 ζ 2 zeta_(2)\zeta_{2}ζ2 2 β + 2 β 2 3 γ 1 2 β + 2 β 2 3 γ 1 2beta+2beta_(2)-3gamma-12 \beta+2 \beta_{2}-3 \gamma-12β+2β23γ1 2 β α 1 2 β α 1 2beta-alpha^(')-12 \beta-\alpha^{\prime}-12βα1
ζ 3 ζ 3 zeta_(3)\zeta_{3}ζ3 β 3 1 β 3 1 beta_(3)-1\beta_{3}-1β31 absent
ζ 4 ζ 4 zeta_(4)\zeta_{4}ζ4 β 4 γ β 4 γ beta_(4)-gamma\beta_{4}-\gammaβ4γ absent
"Will-Nordtvedt revised parameters" "Revised parameters in notation of this book" "Revised parameters in notation of Nordtvedt (" 1968b ")^(c)" gamma gamma gamma beta beta beta alpha_(1) 7Delta_(1)+Delta_(2)-4gamma-4 8Delta-4gamma-4 alpha_(2) Delta_(2)+zeta-1 alpha^(''')-1 alpha_(3) 4beta_(1)-2gamma-2-zeta 4alpha^('')-alpha^(''')-2gamma-1 zeta_(1) zeta alpha^(''')-chi zeta_(2) 2beta+2beta_(2)-3gamma-1 2beta-alpha^(')-1 zeta_(3) beta_(3)-1 absent zeta_(4) beta_(4)-gamma absent| Will-Nordtvedt <br> revised parameters | Revised parameters in <br> notation of this book | Revised parameters in <br> notation of Nordtvedt $(\text { 1968b })^{\mathrm{c}}$ | | :---: | :---: | :---: | | $\gamma$ | $\gamma$ | $\gamma$ | | $\beta$ | $\beta$ | $\beta$ | | $\alpha_{1}$ | $7 \Delta_{1}+\Delta_{2}-4 \gamma-4$ | $8 \Delta-4 \gamma-4$ | | $\alpha_{2}$ | $\Delta_{2}+\zeta-1$ | $\alpha^{\prime \prime \prime}-1$ | | $\alpha_{3}$ | $4 \beta_{1}-2 \gamma-2-\zeta$ | $4 \alpha^{\prime \prime}-\alpha^{\prime \prime \prime}-2 \gamma-1$ | | $\zeta_{1}$ | $\zeta$ | $\alpha^{\prime \prime \prime}-\chi$ | | $\zeta_{2}$ | $2 \beta+2 \beta_{2}-3 \gamma-1$ | $2 \beta-\alpha^{\prime}-1$ | | $\zeta_{3}$ | $\beta_{3}-1$ | absent | | $\zeta_{4}$ | $\beta_{4}-\gamma$ | absent |
a ^("a "){ }^{\text {a }} Revised parameters are used by Will and Nordtvedt (1972), Nordtvedt and Will (1972), Will (1972), and Ni (1973).
b ^("b "){ }^{\text {b }} Notation of this book is used by Will (1971a,b,c,d), Ni (1972), and Thorne, Ni, and Will (1971).
c ^("c "){ }^{\text {c }} Nordtvedt's original "point-particle" parameters were used by Nordtvedt (1968b, 1970, 1971a,b).

B. Significance of Revised Parameters

α 1 , α 2 , α 3 α 1 , α 2 , α 3 alpha_(1),alpha_(2),alpha_(3)\alpha_{1}, \alpha_{2}, \alpha_{3}α1,α2,α3 measure the extent of and nature of "preferred-frame effects"; see § 39.9 § 39.9 §39.9\S 39.9§39.9. Any theory of gravity with at least one α α alpha\alphaα nonzero is called a preferred-frame theory. ζ 1 , ζ 2 , ζ 3 , ζ 4 , α 3 ζ 1 , ζ 2 , ζ 3 , ζ 4 , α 3 zeta_(1),zeta_(2),zeta_(3),zeta_(4),alpha_(3)\zeta_{1}, \zeta_{2}, \zeta_{3}, \zeta_{4}, \alpha_{3}ζ1,ζ2,ζ3,ζ4,α3 measure the extent of and nature of breakdowns in global conservation laws. A theory of gravity possesses, at the post-Newtonian level, all 10 global conservation laws ( 4 for energy-momentum, 6 for angular momentum; see Chapters 19 and 20) if and only if ζ 1 = ζ 2 = ζ 3 = ζ 4 = α 3 = 0 ζ 1 = ζ 2 = ζ 3 = ζ 4 = α 3 = 0 zeta_(1)=zeta_(2)=zeta_(3)=zeta_(4)=alpha_(3)=0\zeta_{1}=\zeta_{2}=\zeta_{3}=\zeta_{4}=\alpha_{3}=0ζ1=ζ2=ζ3=ζ4=α3=0. See Will (1971d), Will and Nordtvedt (1972), Will (1972), for proofs and discussion. Any theory with ζ 1 = ζ 2 ζ 1 = ζ 2 zeta_(1)=zeta_(2)\zeta_{1}=\zeta_{2}ζ1=ζ2 = ζ 3 = ζ 4 = α 3 = 0 = ζ 3 = ζ 4 = α 3 = 0 =zeta_(3)=zeta_(4)=alpha_(3)=0=\zeta_{3}=\zeta_{4}=\alpha_{3}=0=ζ3=ζ4=α3=0 is called a conservative theory.
In general relativity and the Dicke-Brans-Jordan theory, all α α alpha\alphaα 's and ζ ζ zeta\zetaζ 's vanish. Thus, general relativity and Dicke-Brans-Jordan are conservative theories with no preferred-frame effects.
(a) Show that, when written in the chosen PPN coordinate frame, this expression for M A M A M_(A)M_{A}MA becomes
(39.59c) M A = V A ρ o ( 1 + Π + 1 2 v A 2 + 3 U 1 2 U self ) d 3 x + O ( M A ϵ 4 ) (39.59c) M A = V A ρ o 1 + Π + 1 2 v A 2 + 3 U 1 2 U self  d 3 x + O M A ϵ 4 {:(39.59c)M_(A)=int_(V_(A))rho_(o)(1+Pi+(1)/(2)v_(A)^(2)+3U-(1)/(2)U_("self "))d^(3)x+O(M_(A)epsilon^(4)):}\begin{equation*} M_{A}=\int_{V_{A}} \rho_{o}\left(1+\Pi+\frac{1}{2} v_{A}^{2}+3 U-\frac{1}{2} U_{\text {self }}\right) d^{3} x+O\left(M_{A} \epsilon^{4}\right) \tag{39.59c} \end{equation*}(39.59c)MA=VAρo(1+Π+12vA2+3U12Uself )d3x+O(MAϵ4)
Use equations (39.43), (39.44), and (39.46) to show that M A M A M_(A)M_{A}MA is conserved as the bodies move about, d M A / d t = 0 d M A / d t = 0 dM_(A)//dt=0d M_{A} / d t=0dMA/dt=0.
(b) Pick an event ( t , x ) ( t , x ) (t,x)(t, x)(t,x) outside all the bodies, and at time t t ttt denote
(39.59d) r A x A x , r A B x A x B , r A | r A | , r A B | r A B | (39.59d) r A x A x , r A B x A x B , r A r A , r A B r A B {:(39.59d)r_(A)-=x_(A)-x","quadr_(AB)-=x_(A)-x_(B)","quadr_(A)-=|r_(A)|","quadr_(AB)-=|r_(AB)|:}\begin{equation*} r_{A} \equiv x_{A}-x, \quad r_{A B} \equiv x_{A}-x_{B}, \quad r_{A} \equiv\left|r_{A}\right|, \quad r_{A B} \equiv\left|r_{A B}\right| \tag{39.59d} \end{equation*}(39.59d)rAxAx,rABxAxB,rA|rA|,rAB|rAB|
Show that the general-relativistic, post-Newtonian metric (39.32) at the chosen event has the form
(39.60a) g i k = δ j k ( 1 + 2 A M A r A ) + O ( ϵ 4 ) , (39.60b) g 0 j = A M A r A [ 7 2 v A j + 1 2 ( v A r A ) r A j r A 2 ] + O ( ϵ 5 ) , g 00 = (39.60c) 1 + 2 A M A r A 2 ( A M A r A ) 2 + 3 A M A v A 2 r A 2 A B A M A M B r A r A B + O ( ϵ 6 ) . (39.60a) g i k = δ j k 1 + 2 A M A r A + O ϵ 4 , (39.60b) g 0 j = A M A r A 7 2 v A j + 1 2 v A r A r A j r A 2 + O ϵ 5 , g 00 = (39.60c) 1 + 2 A M A r A 2 A M A r A 2 + 3 A M A v A 2 r A 2 A B A M A M B r A r A B + O ϵ 6 . {:[(39.60a)g_(ik)=delta_(jk)(1+2sum_(A)(M_(A))/(r_(A)))+O(epsilon^(4))","],[(39.60b)g_(0j)=-sum_(A)(M_(A))/(r_(A))[(7)/(2)v_(Aj)+(1)/(2)((v_(A)*r_(A))r_(Aj))/(r_(A)^(2))]+O(epsilon^(5))","],[g_(00)=],[(39.60c)-1+2sum_(A)(M_(A))/(r_(A))-2(sum_(A)(M_(A))/(r_(A)))^(2)+3sum_(A)(M_(A)v_(A)^(2))/(r_(A))],[quad-2sum_(A)sum_(B!=A)(M_(A)M_(B))/(r_(A)r_(AB))+O(epsilon^(6)).]:}\begin{align*} & g_{i k}=\delta_{j k}\left(1+2 \sum_{A} \frac{M_{A}}{r_{A}}\right)+O\left(\epsilon^{4}\right), \tag{39.60a}\\ & g_{0 j}=-\sum_{A} \frac{M_{A}}{r_{A}}\left[\frac{7}{2} v_{A j}+\frac{1}{2} \frac{\left(v_{A} \cdot r_{A}\right) r_{A j}}{r_{A}^{2}}\right]+O\left(\epsilon^{5}\right), \tag{39.60b}\\ & g_{00}= \\ & -1+2 \sum_{A} \frac{M_{A}}{r_{A}}-2\left(\sum_{A} \frac{M_{A}}{r_{A}}\right)^{2}+3 \sum_{A} \frac{M_{A} v_{A}^{2}}{r_{A}} \tag{39.60c}\\ & \quad-2 \sum_{A} \sum_{B \neq A} \frac{M_{A} M_{B}}{r_{A} r_{A B}}+O\left(\epsilon^{6}\right) . \end{align*}(39.60a)gik=δjk(1+2AMArA)+O(ϵ4),(39.60b)g0j=AMArA[72vAj+12(vArA)rAjrA2]+O(ϵ5),g00=(39.60c)1+2AMArA2(AMArA)2+3AMAvA2rA2ABAMAMBrArAB+O(ϵ6).
[Hint: From the Newtonian virial theorem (39.21a), applied to body A A AAA by itself in its own rest frame, conclude that
(39.61) T ~ A ( 3 p 1 2 ρ o U self ) d 3 x = O ( M A ϵ 4 ) , (39.61) T ~ A 3 p 1 2 ρ o U self  d 3 x = O M A ϵ 4 , {:(39.61)int_( widetilde(T)_(A))(3p-(1)/(2)rho_(o)U_("self "))d^(3)x=O(M_(A)epsilon^(4))",":}\begin{equation*} \int_{\widetilde{T}_{A}}\left(3 p-\frac{1}{2} \rho_{o} U_{\text {self }}\right) d^{3} x=O\left(M_{A} \epsilon^{4}\right), \tag{39.61} \end{equation*}(39.61)T~A(3p12ρoUself )d3x=O(MAϵ4),
where the integral is performed in the PPN frame.]
(c) Perform an infinitesimal coordinate transformation,
(39.62) t OLD = t NEW 1 2 A M A ( r A v A ) r A , x OLD = x NEW (39.62) t OLD = t NEW 1 2 A M A r A v A r A , x OLD = x NEW {:(39.62)t_(OLD)=t_(NEW)-(1)/(2)sum_(A)(M_(A)(r_(A)*v_(A)))/(r_(A))","quadx_(OLD)=x_(NEW):}\begin{equation*} t_{\mathrm{OLD}}=t_{\mathrm{NEW}}-\frac{1}{2} \sum_{A} \frac{M_{A}\left(r_{A} \cdot v_{A}\right)}{r_{A}}, \quad x_{\mathrm{OLD}}=x_{\mathrm{NEW}} \tag{39.62} \end{equation*}(39.62)tOLD=tNEW12AMA(rAvA)rA,xOLD=xNEW
to bring the metric (39.60) into the standard form originally devised by Einstein, Infeld, and Hoffman (1938), and by Eddington and Clark (1938):
(39.63a) g j k = δ j k ( 1 + 2 A M A r A ) + O ( ϵ 4 ) , (39.63b) g 0 j = 4 A M A r A v A j + O ( ϵ 5 ) , g 00 = 1 + 2 A M A r A 2 ( A M A r A ) 2 + 3 A M A v A 2 r A (39.63c) 2 A B A M A M B r A r A B 2 χ t 2 + O ( ϵ 6 ) , (39.63a) g j k = δ j k 1 + 2 A M A r A + O ϵ 4 , (39.63b) g 0 j = 4 A M A r A v A j + O ϵ 5 , g 00 = 1 + 2 A M A r A 2 A M A r A 2 + 3 A M A v A 2 r A (39.63c) 2 A B A M A M B r A r A B 2 χ t 2 + O ϵ 6 , {:[(39.63a)g_(jk)=delta_(jk)(1+2sum_(A)(M_(A))/(r_(A)))+O(epsilon^(4))","],[(39.63b)g_(0j)=-4sum_(A)(M_(A))/(r_(A))v_(Aj)+O(epsilon^(5))","],[g_(00)=-1+2sum_(A)(M_(A))/(r_(A))-2(sum_(A)(M_(A))/(r_(A)))^(2)+3sum_(A)(M_(A)v_(A)^(2))/(r_(A))],[(39.63c)quad-2sum_(A)sum_(B!=A)(M_(A)M_(B))/(r_(A)r_(AB))-(del^(2)chi)/(delt^(2))+O(epsilon^(6))","]:}\begin{align*} & g_{j k}= \delta_{j k}\left(1+2 \sum_{A} \frac{M_{A}}{r_{A}}\right)+O\left(\epsilon^{4}\right), \tag{39.63a}\\ & g_{0 j}=-4 \sum_{A} \frac{M_{A}}{r_{A}} v_{A j}+O\left(\epsilon^{5}\right), \tag{39.63b}\\ & g_{00}=-1+2 \sum_{A} \frac{M_{A}}{r_{A}}-2\left(\sum_{A} \frac{M_{A}}{r_{A}}\right)^{2}+3 \sum_{A} \frac{M_{A} v_{A}^{2}}{r_{A}} \\ & \quad-2 \sum_{A} \sum_{B \neq A} \frac{M_{A} M_{B}}{r_{A} r_{A B}}-\frac{\partial^{2} \chi}{\partial t^{2}}+O\left(\epsilon^{6}\right), \tag{39.63c} \end{align*}(39.63a)gjk=δjk(1+2AMArA)+O(ϵ4),(39.63b)g0j=4AMArAvAj+O(ϵ5),g00=1+2AMArA2(AMArA)2+3AMAvA2rA(39.63c)2ABAMAMBrArAB2χt2+O(ϵ6),
where χ χ chi\chiχ [equation (39.49a)] is given by
χ = A M A r A χ = A M A r A chi=-sum_(A)M_(A)r_(A)\chi=-\sum_{A} M_{A} r_{A}χ=AMArA
(d) The equations of motion for the bodies can be obtained in either of two ways: by performing a volume integral of the Euler equation (39.48) over the interior of each body; or by invoking the general arguments of § 20.6 § 20.6 §20.6\S 20.6§20.6. The latter way is the easier. Use it to conclude that any chosen body K K KKK moves along a geodesic of the metric obtained by omitting the terms A = K A = K A=KA=KA=K from the sums in (39.63). Show that the geodesic equation for body K K KKK reduces to
d 2 x K d t 2 d v K d t = A K r A K M A r A K 3 [ 1 4 B K M B r B K C A M C r C A ( 1 r A K r C A 2 r C A 2 ) + v K 2 + 2 v A 2 4 v A v K 3 2 ( v A r A K r A K ) 2 ] (39.64) A K ( v A v K ) M A r A K ( 3 v A 4 v K ) r A K 3 + 7 2 A K C A r C A M A M C r A K r C A 3 d 2 x K d t 2 d v K d t = A K r A K M A r A K 3 1 4 B K M B r B K C A M C r C A 1 r A K r C A 2 r C A 2 + v K 2 + 2 v A 2 4 v A v K 3 2 v A r A K r A K 2 (39.64) A K v A v K M A r A K 3 v A 4 v K r A K 3 + 7 2 A K C A r C A M A M C r A K r C A 3 {:[(d^(2)x_(K))/(dt^(2))-=(dv_(K))/(dt)=sum_(A!=K)r_(AK)(M_(A))/(r_(AK)^(3))[1-4sum_(B!=K)(M_(B))/(r_(BK))-sum_(C!=A)(M_(C))/(r_(CA))(1-(r_(AK)*r_(CA))/(2r_(CA)^(2))):}],[{:+v_(K)^(2)+2v_(A)^(2)-4v_(A)*v_(K)-(3)/(2)((v_(A)*r_(AK))/(r_(AK)))^(2)]],[(39.64)-sum_(A!=K)(v_(A)-v_(K))(M_(A)r_(AK)*(3v_(A)-4v_(K)))/(r_(AK)^(3))],[+(7)/(2)sum_(A!=K)sum_(C!=A)r_(CA)(M_(A)M_(C))/(r_(AK)r_(CA)^(3))]:}\begin{align*} \frac{d^{2} \boldsymbol{x}_{K}}{d t^{2}} \equiv \frac{d v_{K}}{d t}= & \sum_{A \neq K} r_{A K} \frac{M_{A}}{r_{A K}{ }^{3}}\left[1-4 \sum_{B \neq K} \frac{M_{B}}{r_{B K}}-\sum_{C \neq A} \frac{M_{C}}{r_{C A}}\left(1-\frac{r_{A K} \cdot r_{C A}}{2 r_{C A}{ }^{2}}\right)\right. \\ & \left.+v_{K}{ }^{2}+2 v_{A}{ }^{2}-4 v_{A} \cdot v_{K}-\frac{3}{2}\left(\frac{v_{A} \cdot r_{A K}}{r_{A K}}\right)^{2}\right] \\ - & \sum_{A \neq K}\left(v_{A}-v_{K}\right) \frac{M_{A} r_{A K} \cdot\left(3 v_{A}-4 v_{K}\right)}{r_{A K}{ }^{3}} \tag{39.64}\\ + & \frac{7}{2} \sum_{A \neq K} \sum_{C \neq A} r_{C A} \frac{M_{A} M_{C}}{r_{A K} r_{C A}{ }^{3}} \end{align*}d2xKdt2dvKdt=AKrAKMArAK3[14BKMBrBKCAMCrCA(1rAKrCA2rCA2)+vK2+2vA24vAvK32(vArAKrAK)2](39.64)AK(vAvK)MArAK(3vA4vK)rAK3+72AKCArCAMAMCrAKrCA3
Equations (39.63) and (39.64) are called the Einstein-Infeld-Hoffman ("EIH") equations for the geometry and evolution of a many-body system. They are used widely in analyses of planetary orbits in the solar system. For example, the Caltech Jet Propulsion Laboratory uses them, in modified form, to calculate ephemerides for high-precision tracking of planets and spacecraft. The above method of deriving the EIH equations and metric was devised by Fock (1959). For a similar calculation in the Dicke-Brans-Jordan theory, see Estabrook (1969); and for a derivation of the analogous many-body equations in the full PPN formalism, see Will (1972).

сниетеп 40

SOLAR-SYSTEM EXPERIMENTS

This chapter analyzes experiments using PPN formalism
Complexity of solar system's spacetime geometry

§40.1. MANY EXPERIMENTS OPEN TO DISTINGUISH GENERAL RELATIVITY FROM PROPOSED METRIC THEORIES OF GRAVITY

No audience will show up for a fight if in everyone's eyes the challenger has zero chance to win. No battle-hungry promoter desperately trying to finance the fight can afford to put into the ring against the champion any but the best contender that he can find. Against Einstein's metric theory of gravity, the judgment of the day (as $ 39.2 $ 39.2 $39.2\$ 39.2$39.2 showed) leaves one no option except to put up another theory of gravity that is also metric (or metric plus torsion).
To put on a contest, then, is to design and perform an experiment that distinguishes general relativity from some not completely implausible metric theory of gravity. This chapter describes such experiments-some already performed; some to be performed in the future-and analyses their significance using the PPN formalism of Chapter 39.
In most of the experiments to be described, one investigates the motion of the moon, planets, spacecraft, light rays, or gyroscopes through the spacetime geometry of the solar system. That spacetime geometry is very complicated. It includes the spherical fields of the sun and all the planets, nonspherical fields due to their quadrupolar and higher-order deformations, and fields due to their momentum and angular momentum. Moreover, the spacetime geometry results-or at least in the post-Newtonian formalism it is viewed as resulting-from a nonlinear superposition of all these fields.*
Fortunately for this discussion, several of the most important experiments are free of almost all these complications. The effects they measure are associated entirely with the spherical part of the sun's gravitational field. A description of these experiments will come first ( $ § 40.2 40.5 $ § 40.2 40.5 $§40.2-40.5\$ \S 40.2-40.5$§40.240.5 ), and then attention will turn to experiments that are more complex in principle.
To discuss central-field experiments, one needs an expression for the external gravitational field of an idealized, isolated, static, spherical sun. In general relativity, such a gravitational field is described by the Schwarzschild line element,
d s 2 = ( 1 2 M r ) d t 2 + d r 2 1 2 M / r + r 2 ( d θ 2 + sin 2 ϕ d ϕ 2 ) d s 2 = 1 2 M r d t 2 + d r 2 1 2 M / r + r 2 d θ 2 + sin 2 ϕ d ϕ 2 ds^(2)=-(1-(2M_(o.))/(r))dt^(2)+(dr^(2))/(1-2M_(o.)//r)+r^(2)(dtheta^(2)+sin^(2)phi dphi^(2))d s^{2}=-\left(1-\frac{2 M_{\odot}}{r}\right) d t^{2}+\frac{d r^{2}}{1-2 M_{\odot} / r}+r^{2}\left(d \theta^{2}+\sin ^{2} \phi d \phi^{2}\right)ds2=(12Mr)dt2+dr212M/r+r2(dθ2+sin2ϕdϕ2)
But this line element is not what one wants, for two reasons: (1) it is "too accurate"; (2) it is written in the "wrong" coordinate system.
Why too accurate? Because it is simple only when unperturbed and unmodified; whereas some modified theories show up new effects that are so complex they are tractable only in the post-Newtonian approximation. Why wrong coordinate system? Because physicists, astronomers, and other celestial mechanics have adopted the fairly standard convention of using "isotropic coordinates" rather than "Schwarzschild coordinates" when analyzing the solar system. Example: post-Newtonian expansions, including the PPN formalism of Chapter 39, almost always use isotropic coordinates. Another example: the relativistic ephemeris for the solar system, prepared by the Caltech Jet Propulsion Laboratory [Ohandley et al. (1969); Anderson (1973)] and used extensively throughout the world, employs isotropic coordinates.
Modify the Schwarzschild line element, then. First transform to isotropic coordinates (Exercise 31.7); then expand the metric coefficients in powers of M / r M / r M_(o.)//rM_{\odot} / rM/r, to post-Newtonian accuracy. Thereby obtain
d s 2 = [ 1 2 M r + 2 ( M r ) 2 ] d t 2 + [ 1 + 2 M r ] [ d r 2 + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) ] (40.1) = [ 1 2 M r + 2 ( M r ) 2 ] d t 2 + [ 1 + 2 M r ] [ d x 2 + d y 2 + d z 2 ] . d s 2 = 1 2 M r + 2 M r 2 d t 2 + 1 + 2 M r d r 2 + r 2 d θ 2 + sin 2 θ d ϕ 2 (40.1) = 1 2 M r + 2 M r 2 d t 2 + 1 + 2 M r d x 2 + d y 2 + d z 2 . {:[ds^(2)=-[1-2(M_(o.))/(r)+2((M_(o.))/(r))^(2)]dt^(2)+[1+2(M_(o.))/(r)][dr^(2)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))]],[(40.1)=-[1-2(M_(o.))/(r)+2((M_(o.))/(r))^(2)]dt^(2)+[1+2(M_(o.))/(r)][dx^(2)+dy^(2)+dz^(2)].]:}\begin{align*} d s^{2} & =-\left[1-2 \frac{M_{\odot}}{r}+2\left(\frac{M_{\odot}}{r}\right)^{2}\right] d t^{2}+\left[1+2 \frac{M_{\odot}}{r}\right]\left[d r^{2}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right)\right] \\ & =-\left[1-2 \frac{M_{\odot}}{r}+2\left(\frac{M_{\odot}}{r}\right)^{2}\right] d t^{2}+\left[1+2 \frac{M_{\odot}}{r}\right]\left[d x^{2}+d y^{2}+d z^{2}\right] . \tag{40.1} \end{align*}ds2=[12Mr+2(Mr)2]dt2+[1+2Mr][dr2+r2(dθ2+sin2θdϕ2)](40.1)=[12Mr+2(Mr)2]dt2+[1+2Mr][dx2+dy2+dz2].
Here r , θ , ϕ r , θ , ϕ r,theta,phir, \theta, \phir,θ,ϕ are related to x , y , z x , y , z x,y,zx, y, zx,y,z in the usual manner:
(40.2) r = ( x 2 + y 2 + z 2 ) 1 / 2 , θ = tan 1 [ z / ( x 2 + y 2 ) 1 / 2 ] , ϕ = tan 1 ( y / x ) ; (40.2) r = x 2 + y 2 + z 2 1 / 2 , θ = tan 1 z / x 2 + y 2 1 / 2 , ϕ = tan 1 ( y / x ) ; {:(40.2)r=(x^(2)+y^(2)+z^(2))^(1//2)","quad theta=tan^(-1)[z//(x^(2)+y^(2))^(1//2)]","quad phi=tan^(-1)(y//x);:}\begin{equation*} r=\left(x^{2}+y^{2}+z^{2}\right)^{1 / 2}, \quad \theta=\tan ^{-1}\left[z /\left(x^{2}+y^{2}\right)^{1 / 2}\right], \quad \phi=\tan ^{-1}(y / x) ; \tag{40.2} \end{equation*}(40.2)r=(x2+y2+z2)1/2,θ=tan1[z/(x2+y2)1/2],ϕ=tan1(y/x);
and r r rrr is the new, "isotropic" radial coordinate, not to be confused with the Schwarzschild r r rrr. (The reader who has not studied $ 39.6 $ 39.6 $39.6\$ 39.6$39.6 will discover in the next section why one keeps terms of order M 2 / r 2 M 2 / r 2 M_(o.)^(2)//r^(2)M_{\odot}{ }^{2} / r^{2}M2/r2 in g 00 g 00 g_(00)g_{00}g00 but not in g j k g j k g_(jk)g_{j k}gjk.) Note: this postNewtonian expression for the metric is a special case of the result derived in exercise 19.3.
If one calculates the gravitational field of the same source (the sun) in the same post-Newtonian approximation in other metric theories of gravity, one obtains a very similar result:
Idealization of geometry to that of isolated, static, spherical sun:
(1) in Schwarzschild coordinates
(2) in isotropic coordinates
(3) in PPN formalism
(4) including preferred-frame effects
d s 2 = [ 1 2 M r + 2 β ( M r ) 2 ] d t 2 + [ 1 + 2 γ M r ] [ d r 2 + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) ] (40.3) = [ 1 2 M r + 2 β ( M r ) 2 ] d t 2 + [ 1 + 2 γ M r ] [ d x 2 + d y 2 + d z 2 ] d s 2 = 1 2 M r + 2 β M r 2 d t 2 + 1 + 2 γ M r d r 2 + r 2 d θ 2 + sin 2 θ d ϕ 2 (40.3) = 1 2 M r + 2 β M r 2 d t 2 + 1 + 2 γ M r d x 2 + d y 2 + d z 2 {:[ds^(2)=-[1-2(M_(o.))/(r)+2beta((M_(o.))/(r))^(2)]dt^(2)+[1+2gamma(M_(o.))/(r)][dr^(2)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))]],[(40.3)=-[1-2(M_(o.))/(r)+2beta((M_(o.))/(r))^(2)]dt^(2)+[1+2gamma(M_(o.))/(r)][dx^(2)+dy^(2)+dz^(2)]]:}\begin{align*} d s^{2} & =-\left[1-2 \frac{M_{\odot}}{r}+2 \beta\left(\frac{M_{\odot}}{r}\right)^{2}\right] d t^{2}+\left[1+2 \gamma \frac{M_{\odot}}{r}\right]\left[d r^{2}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right)\right] \\ & =-\left[1-2 \frac{M_{\odot}}{r}+2 \beta\left(\frac{M_{\odot}}{r}\right)^{2}\right] d t^{2}+\left[1+2 \gamma \frac{M_{\odot}}{r}\right]\left[d x^{2}+d y^{2}+d z^{2}\right] \tag{40.3} \end{align*}ds2=[12Mr+2β(Mr)2]dt2+[1+2γMr][dr2+r2(dθ2+sin2θdϕ2)](40.3)=[12Mr+2β(Mr)2]dt2+[1+2γMr][dx2+dy2+dz2]
(see exercise 40.1). Here γ γ gamma\gammaγ and β β beta\betaβ are two of the ten PPN parameters described in Box 39.2. Recall from that box that γ γ gamma\gammaγ measures "the amount of space curvature produced by unit rest mass," while β β beta\betaβ measures "the amount of nonlinearity in the superposition law for g 00 g 00 g_(00)g_{00}g00." These heuristic descriptions find their mathematical counterparts in the above form for the idealized metric surrounding a spherically symmetric center of attraction.
By measuring the parameter γ γ gamma\gammaγ to high precision, one can distinguish between general relativity ( γ = 1 ) ( γ = 1 ) (gamma=1)(\gamma=1)(γ=1) and the Dicke-Brans-Jordan theory [ γ = ( 1 + ω ) / ( 2 + ω ) [ γ = ( 1 + ω ) / ( 2 + ω ) [gamma=(1+omega)//(2+omega)[\gamma=(1+\omega) /(2+\omega)[γ=(1+ω)/(2+ω), where ω ω omega\omegaω is the "Dicke coupling constant"]; see Box 39.2. But general relativity and Dicke-Brans-Jordan predict the same value for β ( β = 1 ) β ( β = 1 ) beta(beta=1)\beta(\beta=1)β(β=1). This identity does not mean that β β beta\betaβ is unworthy of measurement. A value β 1 β 1 beta!=1\beta \neq 1β1 is predicted by other theories [see Ni (1972)]; so measurements of β β beta\betaβ are useful in distinguishing such theories from general relativity.
Actually, the above form (40.3) for the sun's metric is not fully general. In any theory with a preferred "universal rest frame" (e.g., Ni's theory; Box 39.1), there are additional terms in the metric due to motion of the sun relative to that preferred frame (exercise 40.1):
d s 2 = ( expression 40.3 ) + ( α 2 + α 3 α 1 ) M r w 2 d t 2 + 2 ( α 2 1 2 α 1 ) M r w j d x j d t ( ) α 2 [ M r 3 x j x k I r 5 ( x j x k 1 3 r 2 δ j k ) ] w j d t ( 2 d x k + w k d t ) . d s 2 = (  expression  40.3 ) + α 2 + α 3 α 1 M r w 2 d t 2 + 2 α 2 1 2 α 1 M r w j d x j d t ( ) α 2 M r 3 x j x k I r 5 x j x k 1 3 r 2 δ j k w j d t 2 d x k + w k d t . {:[ds^(2)=(" expression "40.3)+(alpha_(2)+alpha_(3)-alpha_(1))(M_(o.))/(r)w^(2)dt^(2)+2(alpha_(2)-(1)/(2)alpha_(1))(M_(o.))/(r)w_(j)dx^(j)dt],[('")"-alpha_(2)[(M_(o.))/(r^(3))x^(j)x^(k)-(I_(o.))/(r^(5))(x^(j)x^(k)-(1)/(3)r^(2)delta_(jk))]w_(j)dt(2dx^(k)+w_(k)dt).]:}\begin{align*} d s^{2}= & (\text { expression } 40.3)+\left(\alpha_{2}+\alpha_{3}-\alpha_{1}\right) \frac{M_{\odot}}{r} w^{2} d t^{2}+2\left(\alpha_{2}-\frac{1}{2} \alpha_{1}\right) \frac{M_{\odot}}{r} w_{j} d x^{j} d t \\ & -\alpha_{2}\left[\frac{M_{\odot}}{r^{3}} x^{j} x^{k}-\frac{I_{\odot}}{r^{5}}\left(x^{j} x^{k}-\frac{1}{3} r^{2} \delta_{j k}\right)\right] w_{j} d t\left(2 d x^{k}+w_{k} d t\right) . \tag{$\prime$} \end{align*}ds2=( expression 40.3)+(α2+α3α1)Mrw2dt2+2(α212α1)Mrwjdxjdt()α2[Mr3xjxkIr5(xjxk13r2δjk)]wjdt(2dxk+wkdt).
In these "preferred-frame terms" I I j j = ρ r 2 d 3 x I I j j = ρ r 2 d 3 x I_(o.)-=I_(jj)=int rhor^(2)d^(3)xI_{\odot} \equiv I_{j j}=\int \rho r^{2} d^{3} xIIjj=ρr2d3x is the trace of the second moment of the sun's mass distribution;
α 1 = 7 Δ 1 + Δ 2 4 γ 4 , α 2 = Δ 2 + ζ 1 , α 3 = 4 β 1 2 γ 2 ζ α 1 = 7 Δ 1 + Δ 2 4 γ 4 , α 2 = Δ 2 + ζ 1 , α 3 = 4 β 1 2 γ 2 ζ {:[alpha_(1)=7Delta_(1)+Delta_(2)-4gamma-4","],[alpha_(2)=Delta_(2)+zeta-1","],[alpha_(3)=4beta_(1)-2gamma-2-zeta]:}\begin{aligned} & \alpha_{1}=7 \Delta_{1}+\Delta_{2}-4 \gamma-4, \\ & \alpha_{2}=\Delta_{2}+\zeta-1, \\ & \alpha_{3}=4 \beta_{1}-2 \gamma-2-\zeta \end{aligned}α1=7Δ1+Δ24γ4,α2=Δ2+ζ1,α3=4β12γ2ζ
are combinations of PPN parameters; and w w w\boldsymbol{w}w is the sun's velocity ( -=\equiv velocity of coordinate system) relative to the preferred frame. (Theories such as general relativity and Dicke-Brans-Jordan, which possess no preferred frame, have α 1 = α 2 = α 3 = 0 α 1 = α 2 = α 3 = 0 alpha_(1)=alpha_(2)=alpha_(3)=0\alpha_{1}=\alpha_{2}=\alpha_{3}=0α1=α2=α3=0, and therefore have no preferred-frame terms in the metric.) For ease of exposition, all equations and calculations in this chapter will ignore the preferred-frame terms; but the consequences of those terms will be discussed and references analyzing them will be cited.

Exercise 40.1. PPN METRIC FOR IDEALIZED SUN [Track 2]

Show that for an isolated, static, spherical sun at rest at the origin of the PPN coordinate system, the PPN metric (39.32) reduces to expressions (40.3), (40.3'). As part of the reduction, show that the sun's total mass-energy is given by
(40.4) M = 0 R ρ 0 ( 1 + 2 β 2 U + β 3 Π + 3 β 4 p / ρ 0 ) 4 π r 2 d r (40.4) M = 0 R ρ 0 1 + 2 β 2 U + β 3 Π + 3 β 4 p / ρ 0 4 π r 2 d r {:(40.4)M_(o.)=int_(0)^(R_(o.))rho_(0)(1+2beta_(2)U+beta_(3)Pi+3beta_(4)p//rho_(0))4pir^(2)dr:}\begin{equation*} M_{\odot}=\int_{0}^{R_{\odot}} \rho_{0}\left(1+2 \beta_{2} U+\beta_{3} \Pi+3 \beta_{4} p / \rho_{0}\right) 4 \pi r^{2} d r \tag{40.4} \end{equation*}(40.4)M=0Rρ0(1+2β2U+β3Π+3β4p/ρ0)4πr2dr
[Warning: One must not look at this formula and immediately think: "The contribution of rest mass is ρ 0 4 π r 2 d r ρ 0 4 π r 2 d r intrho_(0)4pir^(2)dr\int \rho_{0} 4 \pi r^{2} d rρ04πr2dr, the contribution of gravitational energy is 2 β 2 ρ 0 U 4 π r 2 d r 2 β 2 ρ 0 U 4 π r 2 d r int2beta_(2)rho_(0)U4pir^(2)dr\int 2 \beta_{2} \rho_{0} U 4 \pi r^{2} d r2β2ρ0U4πr2dr, etc." Rather, in making any such interpretation one must remember that (1) spacetime is curved, so 4 π r 2 d r 4 π r 2 d r 4pir^(2)dr4 \pi r^{2} d r4πr2dr is not proper volume as measured by physical meter sticks; also (2) virial theorems (exercise 39.6) and other integral theorems can be used to change the form of the integrand. For further discussion see exercises 40.9 and 40.10 below.]
For further discussion see exercises 40.9 and 40.10 below.]  For further discussion see exercises  40.9  and  40.10  below.]  " For further discussion see exercises "40.9" and "40.10" below.] "\text { For further discussion see exercises } 40.9 \text { and } 40.10 \text { below.] } For further discussion see exercises 40.9 and 40.10 below.] 

EXERCISES

§40.2. THE USE OF LIGHT RAYS AND RADIO WAVES TO TEST GRAVITY

In the Newtonian limit, planetary and spacecraft orbits are strongly influenced by gravity; but light propagation and radio-wave propagation (at "infinite" velocity) are not influenced at all. For this reason, experimental studies of orbits are beset by the problem of separating the relativistic effects from much larger standard Newtonian effects. By contrast, experimental studies of light and radio-wave propagation do not contend with any such overpowering Newtonian background. Not surprisingly, they are to date (1973) the clearest and most definitive of the solarsystem experiments.
Mathematically, the parameter that distinguishes a light ray from a planet is its high speed. In the geodesic equation, the magnitude of the velocity determines which metric coefficients can influence the motion. Consider, for example, a weak, static field g α β = η α β + h α β g α β = η α β + h α β g_(alpha beta)=eta_(alpha beta)+h_(alpha beta)g_{\alpha \beta}=\eta_{\alpha \beta}+h_{\alpha \beta}gαβ=ηαβ+hαβ, and a particle at ( x , y , z ) = ( r , 0 , 0 ) ( x , y , z ) = ( r , 0 , 0 ) (x,y,z)=(r,0,0)(x, y, z)=(r, 0,0)(x,y,z)=(r,0,0) moving with velocity ( v x , v y , v z ) = ( 0 , v , 0 ) v x , v y , v z = ( 0 , v , 0 ) (v_(x),v_(y),v_(z))=(0,v,0)\left(v_{x}, v_{y}, v_{z}\right)=(0, v, 0)(vx,vy,vz)=(0,v,0); see Figure 40.1. Here the effect of gravity on the trajectory of the particle can be characterized by the quantity
( curvature of trajectory in 3-dimensional, nearly Euclidean, space ) = ( radius of curvature of trajectory ) 1 (  curvature of trajectory in 3-dimensional,   nearly Euclidean, space  ) = (  radius of curvature   of trajectory  ) 1 ((" curvature of trajectory in 3-dimensional, ")/(" nearly Euclidean, space "))=((" radius of curvature ")/(" of trajectory "))^(-1)\binom{\text { curvature of trajectory in 3-dimensional, }}{\text { nearly Euclidean, space }}=\binom{\text { radius of curvature }}{\text { of trajectory }}^{-1}( curvature of trajectory in 3-dimensional,  nearly Euclidean, space )=( radius of curvature  of trajectory )1
= d 2 x d y 2 = d τ d y d d τ ( d τ d y d x d τ ) = 1 u y d d τ ( u x u y ) = 1 ( u y ) 2 d u x d τ = ( 1 v 2 ) v 2 Γ x α β d x α d τ d x β d τ = 1 v 2 Γ x α β d x α d t d x β d t = Γ x 00 v 2 2 Γ x 0 y v 1 Γ x y y = 1 2 h 00 , x v 2 + ( h 0 y , x h 0 x , y ) v 1 + ( 1 2 h y y , x h x y , y ) = d 2 x d y 2 = d τ d y d d τ d τ d y d x d τ = 1 u y d d τ u x u y = 1 u y 2 d u x d τ = 1 v 2 v 2 Γ x α β d x α d τ d x β d τ = 1 v 2 Γ x α β d x α d t d x β d t = Γ x 00 v 2 2 Γ x 0 y v 1 Γ x y y = 1 2 h 00 , x v 2 + h 0 y , x h 0 x , y v 1 + 1 2 h y y , x h x y , y {:[=(d^(2)x)/(dy^(2))=(d tau)/(dy)(d)/(d tau)((d tau)/(dy)(dx)/(d tau))=(1)/(u^(y))(d)/(d tau)((u^(x))/(u^(y)))=(1)/((u^(y))^(2))(du^(x))/(d tau)],[=-((1-v^(2)))/(v^(2))Gamma^(x)_(alpha beta)(dx^(alpha))/(d tau)(dx^(beta))/(d tau)=-(1)/(v^(2))Gamma^(x)_(alpha beta)(dx^(alpha))/(dt)(dx^(beta))/(dt)],[=-Gamma^(x)_(00)v^(-2)-2Gamma^(x)_(0y)v^(-1)-Gamma^(x)_(yy)],[=(1)/(2)h_(00,x)v^(-2)+(h_(0y,x)-h_(0x,y))v^(-1)+((1)/(2)h_(yy,x)-h_(xy,y))]:}\begin{aligned} & =\frac{d^{2} x}{d y^{2}}=\frac{d \tau}{d y} \frac{d}{d \tau}\left(\frac{d \tau}{d y} \frac{d x}{d \tau}\right)=\frac{1}{u^{y}} \frac{d}{d \tau}\left(\frac{u^{x}}{u^{y}}\right)=\frac{1}{\left(u^{y}\right)^{2}} \frac{d u^{x}}{d \tau} \\ & =-\frac{\left(1-v^{2}\right)}{v^{2}} \Gamma^{x}{ }_{\alpha \beta} \frac{d x^{\alpha}}{d \tau} \frac{d x^{\beta}}{d \tau}=-\frac{1}{v^{2}} \Gamma^{x}{ }_{\alpha \beta} \frac{d x^{\alpha}}{d t} \frac{d x^{\beta}}{d t} \\ & =-\Gamma^{x}{ }_{00} v^{-2}-2 \Gamma^{x}{ }_{0 y} v^{-1}-\Gamma^{x}{ }_{y y} \\ & =\frac{1}{2} h_{00, x} v^{-2}+\left(h_{0 y, x}-h_{0 x, y}\right) v^{-1}+\left(\frac{1}{2} h_{y y, x}-h_{x y, y}\right) \end{aligned}=d2xdy2=dτdyddτ(dτdydxdτ)=1uyddτ(uxuy)=1(uy)2duxdτ=(1v2)v2Γxαβdxαdτdxβdτ=1v2Γxαβdxαdtdxβdt=Γx00v22Γx0yv1Γxyy=12h00,xv2+(h0y,xh0x,y)v1+(12hyy,xhxy,y)
Light rays and radio waves give "clean" tests of relativity
Figure 40.1.
The bending of the trajectory of a test body at its point of closest approach to the sun, as a function of its 3 -velocity. (See text for computation and discussion.)
Reexpressed in spherical coordinates, in the terminology of the idealized solar line element (40.3), this formula says
( curvature of trajectory in 3-space = 1 2 h 00 , r v 2 + 1 2 ( h ϕ ϕ / r 2 ) , r ( M / r 2 ) ( v 2 + γ )  curvature of trajectory   in 3-space  = 1 2 h 00 , r v 2 + 1 2 h ϕ ϕ / r 2 , r M / r 2 v 2 + γ {:[([" curvature of trajectory "],[" in 3-space "]:}=(1)/(2)h_(00,r)v^(-2)+(1)/(2)(h_(phi phi)//r^(2))_(,r)],[~~-(M_(o.)//r^(2))(v^(-2)+gamma)]:}\begin{align*} \left(\begin{array}{l} \text { curvature of trajectory } \\ \text { in 3-space } \end{array}\right. & =\frac{1}{2} h_{00, r} v^{-2}+\frac{1}{2}\left(h_{\phi \phi} / r^{2}\right)_{, r} \\ & \approx-\left(M_{\odot} / r^{2}\right)\left(v^{-2}+\gamma\right) \end{align*}( curvature of trajectory  in 3-space =12h00,rv2+12(hϕϕ/r2),r(M/r2)(v2+γ)
for a particle at its point of closest approach to the sun. (Compare with exercise 25.21.) Note that here γ γ gamma\gammaγ is a PPN parameter; it is not ( 1 v 2 ) 1 / 2 1 v 2 1 / 2 (1-v^(2))^(-1//2)\left(1-v^{2}\right)^{-1 / 2}(1v2)1/2.
Notice what happens as one boosts the velocity of the particle. For slow velocities [ v 2 ( v 2 v^(2)∼(:}v^{2} \sim\left(\right.v2( post-Newtonian expansion parameter ϵ 2 ) M / R ] ϵ 2 M / R {:epsilon^(2))~~M_(o.)//R_(o.)]\left.\left.\epsilon^{2}\right) \approx M_{\odot} / R_{\odot}\right]ϵ2)M/R], the Newtonian part of h 00 h 00 h_(00)h_{00}h00 dominates completely; and the tiny post-Newtonian corrections come equally from the ϵ 4 ϵ 4 epsilon^(4)\epsilon^{4}ϵ4 part of h 00 h 00 h_(00)h_{00}h00, the ϵ 3 ϵ 3 epsilon^(3)\epsilon^{3}ϵ3 part of h 0 j h 0 j h_(0j)h_{0 j}h0j, and the ϵ 2 ϵ 2 epsilon^(2)\epsilon^{2}ϵ2 part of h j k h j k h_(jk)h_{j k}hjk. [This was the justification for expanding h 00 h 00 h_(00)h_{00}h00 to O ( ϵ 4 ) , h 0 j O ϵ 4 , h 0 j O(epsilon^(4)),h_(0j)O\left(\epsilon^{4}\right), h_{0 j}O(ϵ4),h0j to O ( ϵ 3 ) O ϵ 3 O(epsilon^(3))O\left(\epsilon^{3}\right)O(ϵ3), and h j k h j k h_(jk)h_{j k}hjk to O ( ϵ 2 ) O ϵ 2 O(epsilon^(2))O\left(\epsilon^{2}\right)O(ϵ2) in the postNewtonian limit; see §39.6.] But as v v vvv increases, the ordering of the terms changes. In the high-v regime ( v 1 ϵ 2 ) v 1 ϵ 2 (v∼1≫epsilon^(2))\left(v \sim 1 \gg \epsilon^{2}\right)(v1ϵ2), the bending of the trajectory has become almost imperceptible because of the high forward momentum of the particle and the short time it receives transverse momentum from the sun. What bending is left is due to the ϵ 2 ϵ 2 epsilon^(2)\epsilon^{2}ϵ2 (Newtonian) part of h 00 h 00 h_(00)h_{00}h00, and the ϵ 2 ϵ 2 epsilon^(2)\epsilon^{2}ϵ2 (post-Newtonian) part of h j k h j k h_(jk)h_{j k}hjk. Nothing else can have a significant influence. Notice, moreover, that-even when one allows for "preferred-frame" effects-these dominant terms,
h 00 = 2 U = 2 M / r and h j k = 2 γ U δ j k = 2 γ ( M / r ) δ j k , h 00 = 2 U = 2 M / r  and  h j k = 2 γ U δ j k = 2 γ M / r δ j k , h_(00)=2U=2M_(o.)//r" and "h_(jk)=2gamma Udelta_(jk)=2gamma(M_(o.)//r)delta_(jk),h_{00}=2 U=2 M_{\odot} / r \text { and } h_{j k}=2 \gamma U \delta_{j k}=2 \gamma\left(M_{\odot} / r\right) \delta_{j k},h00=2U=2M/r and hjk=2γUδjk=2γ(M/r)δjk,
depend only on the Newtonian potential U Φ U Φ U-=-PhiU \equiv-\PhiUΦ and the PPN parameter γ γ gamma\gammaγ.
This is a special case of a more general result: Aside from fractional corrections of ϵ 2 10 6 ϵ 2 10 6 epsilon^(2) <= 10^(-6)\epsilon^{2} \leq 10^{-6}ϵ2106, relativistic effects on light and radio-wave propagation are governed entirely by the Newtonian potential U U UUU and the PPN parameter γ γ gamma\gammaγ. These relativistic effects include the gravitational redshift (discussed in the last chapter; independent
Light rays are governed solely by Newtonian potential and PPN parameter γ γ gamma\gammaγ

of γ γ gamma\gammaγ ), the gravitational deflection of light and radio waves (discussed below; dependent on γ γ gamma\gammaγ ), and the "relativistic time-delay" (discussed below; dependent on γ γ gamma\gammaγ ).

§40.3. '"LIGHT"' DEFLECTION

Consider a light or radio ray coming into a telescope on Earth from a distant star or quasar. Do not assume, as in the usual discussion (exercises 18.6 and 25.24), that the ray passes near the sun. The deflection by the sun's gravitational field will probably be measurable, in the middle or late 1970 's, even when the ray passes far from the sun! [The calculation that follows is due to Ward (1970), but Shapiro (1967) first derived the answer.]
Orient the PPN spherical coordinates of equation (40.3) so that the ray lies in the "plane" θ = π / 2 θ = π / 2 theta=pi//2\theta=\pi / 2θ=π/2. By symmetry, if it starts out in this plane far from the Earth, it must lie in this plane always. Let the incoming ray enter the solar system along the line ϕ = 0 ϕ = 0 phi=0\phi=0ϕ=0; and let the Earth be located at r = r E , ϕ = ϕ E r = r E , ϕ = ϕ E r=r_(E),phi=phi_(E)r=r_{E}, \phi=\phi_{E}r=rE,ϕ=ϕE when the ray reaches it. (See Figure 40.2.) One wishes to calculate the angle α α alpha\alphaα between the incoming light ray and the center of the sun, as measured in the orthonormal frame ( e r ^ , e ϕ ^ ) e r ^ , e ϕ ^ (e_( hat(r)),e_( hat(phi)))\left(\boldsymbol{e}_{\hat{r}}, \boldsymbol{e}_{\hat{\phi}}\right)(er^,eϕ^) of an observer on Earth. If the sun had zero mass (flat, Euclidean space), α α alpha\alphaα would be π ϕ E π ϕ E pi-phi_(E)\pi-\phi_{E}πϕE (see Figure 40.2). However, the sun produces a deflection: α = π ϕ E α = π ϕ E alpha=pi-phi_(E)\alpha=\pi-\phi_{E}α=πϕE + δ α + δ α +delta alpha+\delta \alpha+δα. The deflection angle δ α δ α delta alpha\delta \alphaδα is the true objective of the calculation.
In the calculation, ignore the Earth's orbital and rotational motions. They lead to aberration, for which correction can be made by the usual formula of special relativity (Lorentz transformation in the neighborhood of the telescope.) Also ignore deflection of the light ray due to the Earth's gravitational field (deflection angle ~ 2 M E / R E 0 .0003 2 M E / R E 0 .0003 2M_(E)//R_(E)∼0^('').00032 M_{E} / R_{E} \sim 0^{\prime \prime} .00032ME/RE0.0003 ), which might be detectable in the late 1970's.
Figure 40.2.
Coordinates used in the text for calculating the deflection of light. Notice that in this diagram ϕ ϕ phi\phiϕ increases in the clockwise direction.
As the first step in calculating the deflection angle, determine the trajectory of the ray in the r , ϕ r , ϕ r,phir, \phir,ϕ-plane. This can be calculated either using the geodesic equation, or using the eikonal method of geometric optics (Hamilton-Jacobi method; §22.5 and Box 25.4). The result of such a calculation (exercise 40.2) is an equation connecting r r rrr with ϕ ϕ phi\phiϕ; thus,
(40.6) b r = sin ϕ + ( 1 + γ ) M b ( 1 cos ϕ ) (40.6) b r = sin ϕ + ( 1 + γ ) M b ( 1 cos ϕ ) {:(40.6)(b)/(r)=sin phi+((1+gamma)M_(o.))/(b)(1-cos phi):}\begin{equation*} \frac{b}{r}=\sin \phi+\frac{(1+\gamma) M_{\odot}}{b}(1-\cos \phi) \tag{40.6} \end{equation*}(40.6)br=sinϕ+(1+γ)Mb(1cosϕ)
Notice that b b bbb has a simple geometric interpretation: far from the sun, the ray trajectory is ϕ = b / r + O ( M b / r 2 ) ϕ = b / r + O M b / r 2 phi=b//r+O(M_(o.)b//r^(2))\phi=b / r+O\left(M_{\odot} b / r^{2}\right)ϕ=b/r+O(Mb/r2). Consequently, b b bbb is the impact parameter in the usual sense of classical scattering theory (see Figure 40.2). The ray makes its closest approach to the sun (assuming it is not intercepted by the Earth first) at the PPN coordinate radius
(40.7) r min = b [ 1 ( 1 + γ ) M b ] b (40.7) r min = b 1 ( 1 + γ ) M b b {:(40.7)r_(min)=b[1-((1+gamma)M_(o.))/(b)]~~b:}\begin{equation*} r_{\min }=b\left[1-\frac{(1+\gamma) M_{\odot}}{b}\right] \approx b \tag{40.7} \end{equation*}(40.7)rmin=b[1(1+γ)Mb]b
Thus, b b bbb can also be thought of as the radius of the ray's "perihelion."
Notice that the ray returns to r = r = r=oor=\inftyr=, not at an angle ϕ = π ϕ = π phi=pi\phi=\piϕ=π, but rather at
(40.8a) ϕ ( r = ) = π + 2 ( 1 + γ ) M / b (40.8a) ϕ ( r = ) = π + 2 ( 1 + γ ) M / b {:(40.8a)phi(r=oo)=pi+2(1+gamma)M_(o.)//b:}\begin{equation*} \phi(r=\infty)=\pi+2(1+\gamma) M_{\odot} / b \tag{40.8a} \end{equation*}(40.8a)ϕ(r=)=π+2(1+γ)M/b
Thus, the total deflection angle is
( angle of total deflection ) = 2 ( 1 + γ ) M / b (40.8b) = 1 2 ( 1 + γ ) 1 .75 for a ray that just grazes the sun. (  angle of total deflection  ) = 2 ( 1 + γ ) M / b (40.8b) = 1 2 ( 1 + γ ) 1 .75  for a ray that   just grazes the sun.  {:[(" angle of total deflection ")=2(1+gamma)M_(o.)//b],[(40.8b)=(1)/(2)(1+gamma)1^('').75" for a ray that "],[quad" just grazes the sun. "]:}\begin{align*} &(\text { angle of total deflection })=2(1+\gamma) M_{\odot} / b \\ &=\frac{1}{2}(1+\gamma) 1^{\prime \prime} .75 \text { for a ray that } \tag{40.8b}\\ & \quad \text { just grazes the sun. } \end{align*}( angle of total deflection )=2(1+γ)M/b(40.8b)=12(1+γ)1.75 for a ray that  just grazes the sun. 
But this is not the quantity of primary interest. Rather, one seeks the position of the star as seen by an astronomer on Earth. The angle α = π ϕ E + δ α α = π ϕ E + δ α alpha=pi-phi_(E)+delta alpha\alpha=\pi-\phi_{E}+\delta \alphaα=πϕE+δα between the sun and the star as measured by the astronomer is given by (see Figure 40.2)
tan ( π ϕ E + δ α ) = tan ϕ E + δ α / cos 2 ϕ E (40.9) = u ϕ ^ u r = [ ( 1 + γ M / r ) r d ϕ / d λ ( 1 + γ M / r ) d r / d λ ] E = [ r d ϕ d r ] E = [ ( b / r ) d ϕ d ( b / r ) ] E tan π ϕ E + δ α = tan ϕ E + δ α / cos 2 ϕ E (40.9) = u ϕ ^ u r = 1 + γ M / r r d ϕ / d λ 1 + γ M / r d r / d λ E = r d ϕ d r E = ( b / r ) d ϕ d ( b / r ) E {:[tan(pi-phi_(E)+delta alpha)=-tan phi_(E)+delta alpha//cos^(2)phi_(E)],[(40.9)=(u^( hat(phi)))/(u^(r))=[((1+gammaM_(o.)//r)rd phi//d lambda)/((1+gammaM_(o.)//r)dr//d lambda)]_(E)=[(rd phi)/(dr)]_(E)],[=-[((b//r)d phi)/(d(b//r))]_(E)]:}\begin{align*} \tan \left(\pi-\phi_{E}+\delta \alpha\right) & =-\tan \phi_{E}+\delta \alpha / \cos ^{2} \phi_{E} \\ & =\frac{u^{\hat{\phi}}}{u^{r}}=\left[\frac{\left(1+\gamma M_{\odot} / r\right) r d \phi / d \lambda}{\left(1+\gamma M_{\odot} / r\right) d r / d \lambda}\right]_{E}=\left[\frac{r d \phi}{d r}\right]_{E} \tag{40.9}\\ & =-\left[\frac{(b / r) d \phi}{d(b / r)}\right]_{E} \end{align*}tan(πϕE+δα)=tanϕE+δα/cos2ϕE(40.9)=uϕ^ur=[(1+γM/r)rdϕ/dλ(1+γM/r)dr/dλ]E=[rdϕdr]E=[(b/r)dϕd(b/r)]E
where u β = d x β / d λ u β = d x β / d λ u^(beta)=dx^(beta)//d lambdau^{\beta}=d x^{\beta} / d \lambdauβ=dxβ/dλ are the components of a tangent to the ray at the Earth. By inserting into this equation expression (40.6) for the trajectory of the ray, one obtains
(40.10) tan ϕ E δ α cos 2 ϕ E = sin ϕ E + [ ( 1 + γ ) M / b ] ( 1 cos ϕ E ) cos ϕ E + [ ( 1 + γ ) M / b ] sin ϕ E = tan ϕ E [ ( 1 + γ ) M / b ] ( 1 cos ϕ E ) / cos 2 ϕ E (40.10) tan ϕ E δ α cos 2 ϕ E = sin ϕ E + ( 1 + γ ) M / b 1 cos ϕ E cos ϕ E + ( 1 + γ ) M / b sin ϕ E = tan ϕ E ( 1 + γ ) M / b 1 cos ϕ E / cos 2 ϕ E {:[(40.10)tan phi_(E)-(delta alpha)/(cos^(2)phi_(E))=(sin phi_(E)+[(1+gamma)M_(o.)//b](1-cos phi_(E)))/(cos phi_(E)+[(1+gamma)M_(o.)//b]sin phi_(E))],[=tan phi_(E)-[(1+gamma)M_(o.)//b](1-cos phi_(E))//cos^(2)phi_(E)]:}\begin{align*} \tan \phi_{E}-\frac{\delta \alpha}{\cos ^{2} \phi_{E}} & =\frac{\sin \phi_{E}+\left[(1+\gamma) M_{\odot} / b\right]\left(1-\cos \phi_{E}\right)}{\cos \phi_{E}+\left[(1+\gamma) M_{\odot} / b\right] \sin \phi_{E}} \tag{40.10}\\ & =\tan \phi_{E}-\left[(1+\gamma) M_{\odot} / b\right]\left(1-\cos \phi_{E}\right) / \cos ^{2} \phi_{E} \end{align*}(40.10)tanϕEδαcos2ϕE=sinϕE+[(1+γ)M/b](1cosϕE)cosϕE+[(1+γ)M/b]sinϕE=tanϕE[(1+γ)M/b](1cosϕE)/cos2ϕE
Thus, the deflection angle measured at the Earth is
(40.11) δ α = ( 1 + γ ) M b ( 1 + cos α ) = ( 1 + γ ) M r E ( 1 + cos α 1 cos α ) 1 / 2 (40.11) δ α = ( 1 + γ ) M b ( 1 + cos α ) = ( 1 + γ ) M r E 1 + cos α 1 cos α 1 / 2 {:(40.11)delta alpha=((1+gamma)M_(o.))/(b)(1+cos alpha)=((1+gamma)M_(o.))/(r_(E))((1+cos alpha)/(1-cos alpha))^(1//2):}\begin{equation*} \delta \alpha=\frac{(1+\gamma) M_{\odot}}{b}(1+\cos \alpha)=\frac{(1+\gamma) M_{\odot}}{r_{E}}\left(\frac{1+\cos \alpha}{1-\cos \alpha}\right)^{1 / 2} \tag{40.11} \end{equation*}(40.11)δα=(1+γ)Mb(1+cosα)=(1+γ)MrE(1+cosα1cosα)1/2
It ranges from zero when the ray comes in opposite to the sun's direction ( α = π ) ( α = π ) (alpha=pi)(\alpha=\pi)(α=π), through the value
(40.12) ( 1 + γ ) M / r E = 1 2 ( 1 + γ ) 0 .0041 (40.12) ( 1 + γ ) M / r E = 1 2 ( 1 + γ ) 0 .0041 {:(40.12)(1+gamma)M_(o.)//r_(E)=(1)/(2)(1+gamma)0^('').0041:}\begin{equation*} (1+\gamma) M_{\odot} / r_{E}=\frac{1}{2}(1+\gamma) 0^{\prime \prime} .0041 \tag{40.12} \end{equation*}(40.12)(1+γ)M/rE=12(1+γ)0.0041
when the ray comes in perpendicular to the Earth-Sun line ( α = π / 2 α = π / 2 alpha=pi//2\alpha=\pi / 2α=π/2 ), to the "classical value" of 1 2 ( 1 + γ ) × 1 .75 1 2 ( 1 + γ ) × 1 .75 (1)/(2)(1+gamma)xx1^('').75\frac{1}{2}(1+\gamma) \times 1^{\prime \prime} .7512(1+γ)×1.75 when the ray comes in grazing the sun's limb.
All experiments to date (1972) have examined the case of grazing passage. The experimental results are stated and discussed in Box 40.1. They show that the PPN parameter γ γ gamma\gammaγ has its general relativistic value of 1 to within an uncertainty of about 20 percent.
By the middle or late 1970's, measurements of the deflection of radio waves from quasars should determine γ γ gamma\gammaγ to much better than 1 percent. Also, by that time radio astronomers may be making progress toward setting up high-precision coordinates on the sky using very long baseline interferometry. If so, they will have to use equation (40.11) to compensate for the "warping" of the coordinates caused by the sun's deflection of radio waves in all regions of the sky, not just near the solar limb.
(2) formula for deflection angle
Experimental measurements of deflection

Exercise 40.2. TRAJECTORY OF LIGHT RAY IN SUN'S GRAVITATIONAL FIELD

Derive equation (40.6) for the path of a light ray in isotropic coordinates (40.3) in the sun's "equatorial plane." Use one or more of three alternative approaches: (1) direct integration of the geodesic equation (the hardest approach!); (2) computation based on the three integrals of the motion
k k = 0 , k ( / t ) = k 0 , k ( / ϕ ) = k ϕ = b k 0 k d / d λ = tangent vector to geodesic k k = 0 , k ( / t ) = k 0 , k ( / ϕ ) = k ϕ = b k 0 k d / d λ =  tangent vector to geodesic  {:[k*k=0","quad k*(del//del t)=k_(0)","quad k*(del//del phi)=k_(phi)=-bk_(0)],[k-=d//d lambda=" tangent vector to geodesic "]:}\begin{gathered} \boldsymbol{k} \cdot \boldsymbol{k}=0, \quad \boldsymbol{k} \cdot(\partial / \partial t)=k_{0}, \quad \boldsymbol{k} \cdot(\partial / \partial \phi)=k_{\phi}=-b k_{0} \\ \boldsymbol{k} \equiv d / d \lambda=\text { tangent vector to geodesic } \end{gathered}kk=0,k(/t)=k0,k(/ϕ)=kϕ=bk0kd/dλ= tangent vector to geodesic 
(see § § 25.2 § § 25.2 §§25.2\S \S 25.2§§25.2 and 25.3); (3) computation based on the Hamilton-Jacobi method (Box 25.4), which for photons (zero rest mass) reduces to the "eikonal method" of geometric optics (see §22.5).

EXERCISE

§40.4. TIME-DELAY IN RADAR PROPAGATION

Another effect of spacetime curvature on electromagnetic waves is a relativistic delay in the round-trip travel time for radar signals. It was first pointed out by Shapiro (1964); see also Muhleman and Reichley ( 1964 , 1965 ) ( 1964 , 1965 ) (1964,1965)(1964,1965)(1964,1965).

Eclipse Measurements

Until 1968 every experiment measured the deflection of starlight during total eclipse of the sun. The measurements were beset by difficulties such as poor weather, optical distortions due to temperature changes, and the strange propensity of eclipses to attain maximum time of totality in jungles, in the middles of oceans, in deserts, and in arctic tundras. Lists of all the results and references are given by Bertotti, Brill and Krotkov (1962), and by Klüber (1960). Dicke (1964b) summarizes the results as follows:
"The analyses [of the experimental data] scatter from a deflection at the limb of the sun of 1.43 seconds of arc to 2.7 seconds [compared to a general relativistic value of 1.75 seconds]. The scatter would not be too bad if one could believe that the technique was free of systematic errors. It appears that one must consider this observation uncertain to at least 10 percent, and perhaps as much as 20 percent." This result corresponds to an uncertainty in γ γ gamma\gammaγ of 20 to 40 percent.

Measurements on the Deflection of Radio Waves

Each October 8 the sun, as seen from the Earth, passes in front of the quasar 3C279. By monitoring the angular separation between 3C279 and a nearby quasar 3C273, radio astronomers can measure the deflection by the sun of the 3C279 radio waves. The monitoring uses radio interferometers. [See references cited in table for discussion of the technique.] Technology of the early 1970's should permit measurements to a precision 0.001 seconds of arc or better, if the two ends of the interferometer are separated by several thousand kilometers ("transcontinental" or "transworld" baseline). But as of 1971 the only successful experiments were less ambitious: they used baselines of less than 10 kilometers. A summary of these pre-1971, short-baseline results is shown in the table.
The 90 -foot (background) and 130 -foot (foreground) radio interferometer system at Caltech's Owens Valley Radio Observatory. These were used by the deflection of quasar radio waves by the sun. During the experiment the two antennas were separated by 1.07 kilometers. (Photo by Alan Moffet.)
Dates of observation Observatory Experimenters and reference Number of telescopes and separations Wave lengths Experimental results a ^("a "){ }^{\text {a }}
1 2 ( 1 + γ ) = ( Observed deflection ) ( Einstein prediction ) 1 2 ( 1 + γ ) = (  Observed   deflection  ) (  Einstein   prediction  ) (1)/(2)(1+gamma)=(((" Observed ")/(" deflection ")))/(((" Einstein ")/(" prediction ")))\frac{1}{2}(1+\gamma)=\frac{\binom{\text { Observed }}{\text { deflection }}}{\binom{\text { Einstein }}{\text { prediction }}}12(1+γ)=( Observed  deflection )( Einstein  prediction ) Formal standard error Onesigma error
Sept. 30 Oct. 15 1969  Sept.  30  Oct.  15 1969 {:[" Sept. "30-" Oct. "15],[1969]:}\begin{gathered} \text { Sept. } 30-\text { Oct. } 15 \\ 1969 \end{gathered} Sept. 30 Oct. 151969 Owens Valley (Caltech) Seielstadt, Sramek, Weiler (1970) 2 , 1.07 km 2 , 1.07 km {:[2","],[1.07km]:}\begin{gathered} 2, \\ 1.07 \mathrm{~km} \end{gathered}2,1.07 km 3.1 cm 1.01 ± 0.12 ± 0.12 +-0.12\pm 0.12±0.12 ± 0.12 ± 0.12 +-0.12\pm 0.12±0.12
Oct. 2-Oct. 10 1969  Oct. 2-Oct.  10 1969 {:[" Oct. 2-Oct. "10],[1969]:}\begin{aligned} & \text { Oct. 2-Oct. } 10 \\ & 1969 \end{aligned} Oct. 2-Oct. 101969 Goldstone (Caltech-JPL)
Muhleman, Ekers,
Fomalont (1970)
Muhleman, Ekers, Fomalont (1970)| Muhleman, Ekers, | | :--- | | Fomalont (1970) |
2 , 21.56 km 2 , 21.56 km {:[2","],[21.56km]:}\begin{gathered} 2, \\ 21.56 \mathrm{~km} \end{gathered}2,21.56 km 12.5 cm 1.04 ± 0.05 ± 0.05 +-0.05\pm 0.05±0.05 + 0.15 0.10 + 0.15 0.10 {:[+0.15],[-0.10]:}\begin{aligned} & +0.15 \\ & -0.10 \end{aligned}+0.150.10
Oct. 2 -Oct. 12 1970  Oct.  2 -Oct.  12 1970 {:[" Oct. "2"-Oct. "12],[1970]:}\begin{gathered} \text { Oct. } 2 \text {-Oct. } 12 \\ 1970 \end{gathered} Oct. 2-Oct. 121970 National Radio Astronomy Observatory (USA) Sramek (1971) 3 , 0.80 km , 1.90 km , 2.70 km 3 , 0.80 km , 1.90 km , 2.70 km {:[3","],[0.80km","1.90km","],[2.70km]:}\begin{gathered} 3, \\ 0.80 \mathrm{~km}, 1.90 \mathrm{~km}, \\ 2.70 \mathrm{~km} \end{gathered}3,0.80 km,1.90 km,2.70 km 11.1 cm 3.7 cm 11.1 cm 3.7 cm {:[11.1cm],[3.7cm]:}\begin{array}{r} 11.1 \mathrm{~cm} \\ 3.7 \mathrm{~cm} \end{array}11.1 cm3.7 cm 0.90 ± 0.05 ± 0.05 +-0.05\pm 0.05±0.05 ± 0.05 ± 0.05 +-0.05\pm 0.05±0.05
Sept. 30 Oct. 15 1970  Sept.  30  Oct.  15 1970 {:[" Sept. "30-" Oct. "15],[1970]:}\begin{gathered} \text { Sept. } 30-\text { Oct. } 15 \\ 1970 \end{gathered} Sept. 30 Oct. 151970
Mullard Radio
Astronomy
Observatory (Cambridge Univ.)
Mullard Radio Astronomy Observatory (Cambridge Univ.)| Mullard Radio | | :--- | | Astronomy | | Observatory (Cambridge Univ.) |
Hill (1971) 3 , 0.66 km , 1.41 km 3 , 0.66 km , 1.41 km {:[3","],[0.66km","1.41km]:}\begin{gathered} 3, \\ 0.66 \mathrm{~km}, 1.41 \mathrm{~km} \end{gathered}3,0.66 km,1.41 km 11.1 cm 6.0 cm 11.1 cm 6.0 cm {:[11.1cm],[6.0cm]:}\begin{array}{r} 11.1 \mathrm{~cm} \\ 6.0 \mathrm{~cm} \end{array}11.1 cm6.0 cm 1.07 ± 0.17 ± 0.17 +-0.17\pm 0.17±0.17 ± 0.17 ± 0.17 +-0.17\pm 0.17±0.17
Dates of observation Observatory Experimenters and reference Number of telescopes and separations Wave lengths Experimental results ^("a ") (1)/(2)(1+gamma)=(((" Observed ")/(" deflection ")))/(((" Einstein ")/(" prediction "))) Formal standard error Onesigma error " Sept. 30- Oct. 15 1969" Owens Valley (Caltech) Seielstadt, Sramek, Weiler (1970) "2, 1.07km" 3.1 cm 1.01 +-0.12 +-0.12 " Oct. 2-Oct. 10 1969" Goldstone (Caltech-JPL) "Muhleman, Ekers, Fomalont (1970)" "2, 21.56km" 12.5 cm 1.04 +-0.05 "+0.15 -0.10" " Oct. 2-Oct. 12 1970" National Radio Astronomy Observatory (USA) Sramek (1971) "3, 0.80km,1.90km, 2.70km" "11.1cm 3.7cm" 0.90 +-0.05 +-0.05 " Sept. 30- Oct. 15 1970" "Mullard Radio Astronomy Observatory (Cambridge Univ.)" Hill (1971) "3, 0.66km,1.41km" "11.1cm 6.0cm" 1.07 +-0.17 +-0.17| Dates of observation | Observatory | Experimenters and reference | Number of telescopes and separations | Wave lengths | Experimental results ${ }^{\text {a }}$ | | | | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | | | | | | | $\frac{1}{2}(1+\gamma)=\frac{\binom{\text { Observed }}{\text { deflection }}}{\binom{\text { Einstein }}{\text { prediction }}}$ | Formal standard error | Onesigma error | | $\begin{gathered} \text { Sept. } 30-\text { Oct. } 15 \\ 1969 \end{gathered}$ | Owens Valley (Caltech) | Seielstadt, Sramek, Weiler (1970) | $\begin{gathered} 2, \\ 1.07 \mathrm{~km} \end{gathered}$ | 3.1 cm | 1.01 | $\pm 0.12$ | $\pm 0.12$ | | $\begin{aligned} & \text { Oct. 2-Oct. } 10 \\ & 1969 \end{aligned}$ | Goldstone (Caltech-JPL) | Muhleman, Ekers, <br> Fomalont (1970) | $\begin{gathered} 2, \\ 21.56 \mathrm{~km} \end{gathered}$ | 12.5 cm | 1.04 | $\pm 0.05$ | $\begin{aligned} & +0.15 \\ & -0.10 \end{aligned}$ | | $\begin{gathered} \text { Oct. } 2 \text {-Oct. } 12 \\ 1970 \end{gathered}$ | National Radio Astronomy Observatory (USA) | Sramek (1971) | $\begin{gathered} 3, \\ 0.80 \mathrm{~km}, 1.90 \mathrm{~km}, \\ 2.70 \mathrm{~km} \end{gathered}$ | $\begin{array}{r} 11.1 \mathrm{~cm} \\ 3.7 \mathrm{~cm} \end{array}$ | 0.90 | $\pm 0.05$ | $\pm 0.05$ | | $\begin{gathered} \text { Sept. } 30-\text { Oct. } 15 \\ 1970 \end{gathered}$ | Mullard Radio <br> Astronomy <br> Observatory (Cambridge Univ.) | Hill (1971) | $\begin{gathered} 3, \\ 0.66 \mathrm{~km}, 1.41 \mathrm{~km} \end{gathered}$ | $\begin{array}{r} 11.1 \mathrm{~cm} \\ 6.0 \mathrm{~cm} \end{array}$ | 1.07 | $\pm 0.17$ | $\pm 0.17$ |
a ^("a "){ }^{\text {a }} Here (observed deflection)/(Einstein prediction) is the number 3 2 ( 1 + γ ) 3 2 ( 1 + γ ) (3)/(2)(1+gamma)\frac{3}{2}(1+\gamma)32(1+γ) obtained by fitting the observational data to the PPN prediction (40.11). [For these experiments the ray passes near the solar limb; so (40.11) reduces to δ α = 1 2 ( 1 + γ ) ( M / b ) δ α = 1 2 ( 1 + γ ) M / b delta alpha=(1)/(2)(1+gamma)(M_(o.)//b)\delta \alpha=\frac{1}{2}(1+\gamma)\left(M_{\odot} / b\right)δα=12(1+γ)(M/b).] The "formal standard error" is obtained from the data by standard statistical techniques. However, it is not usually a good measure of the certainty of the result, because it fails to take account of systematic errors. The quoted "one-sigma error" is the experimenters' best estimate of the combined statistical and systematic
uncertainties. The experimenters estimate a probability of 68 percent that the true result is within "lo" of their measured value; a probability of 95 percent that it is within " 2 σ 2 σ 2sigma2 \sigma2σ "; etc.
(1) foundations for calculation; Fermat's principle
(2) details of calculation
Figure 40.3.
Diagram, in the PPN coordinate system, for the calculation of the relativistic time delay.
Let a radar transmitter on Earth send a radio wave out to a reflector elsewhere in the solar system, and let the reflector return the wave to Earth. Calculate the round-trip travel time, as measured by a clock on Earth. For simplicity of calculation, idealize both Earth and reflector as nonrotating and as at rest in the static, spherical gravitational field of the Sun. At the end of the calculation, the effects of rotation and motion will be discussed separately. Also ignore time dilation of the transmitter's clock due to the Earth's gravitational field; it is easily corrected for, and it is so small that it will not come into play in these radar experiments before the middle or late 1970's. The gravitational effects of the other planets on the radio waves are too small to be discernible in the foreseeable future, unless the beam grazes the limb of one of them. However, the effects of dispersion in the solar wind and corona are discernible and must be corrected for. These corrections will not be discussed here, since they are free of any general-relativistic influence.
The calculation of the round-trip travel time can be simplified by using a generalrelativistic version of Fermat's principle: In any static field ( g 0 j = 0 , g α β , 0 = 0 g 0 j = 0 , g α β , 0 = 0 g_(0j)=0,g_(alpha beta,0)=0g_{0 j}=0, g_{\alpha \beta, 0}=0g0j=0,gαβ,0=0 ) consider all null curves between two points in space, x j = a j x j = a j x^(j)=a^(j)x^{j}=a^{j}xj=aj and x j = b j x j = b j x^(j)=b^(j)x^{j}=b^{j}xj=bj. Each such null curve, x j ( t ) x j ( t ) x^(j)(t)x^{j}(t)xj(t), requires a particular coordinate time interval Δ t Δ t Delta t\Delta tΔt to get from a j a j a^(j)a^{j}aj to b j b j b^(j)b^{j}bj. The curves of extremal Δ t Δ t Delta t\Delta tΔt are the null geodesics of spacetime. The proof of this theorem is outlined in exercise 40.3.
Because of Fermat's principle, the lapse of coordinate time between transmission of the radar beam and reflection at the reflector, t TR t TR t_(TR)t_{\mathrm{TR}}tTR, is the same for a straight path in the PPN coordinates as for the slightly curved path which the beam actually follows. (The two differ by a fractional amount Δ t TR / t TR Δ t TR / t TR Deltat_(TR)//t_(TR)∼\Delta t_{\mathrm{TR}} / t_{\mathrm{TR}} \simΔtTR/tTR (angle of deflection) 2 10 12 2 10 12 ^(2) <= 10^(-12){ }^{2} \leqq 10^{-12}21012, which is far from discernable.) Hence, in the computation one can ignore the gravitational bending of the beam.
Adopt Cartesian PPN coordinates with the sun at the origin; the transmitter, sun, and reflector in the z = 0 z = 0 z=0z=0z=0 "plane"; and the transmitter-reflector line along the x x xxx direction (see Figure 40.3). The transmitter is at ( x , y ) = ( a T , b ) ( x , y ) = a T , b (x,y)=(-a_(T),b)(x, y)=\left(-a_{T}, b\right)(x,y)=(aT,b) in the PPN coordinates, and the reflector is at ( x , y ) = ( a R , b ) ( x , y ) = a R , b (x,y)=(a_(R),b)(x, y)=\left(a_{R}, b\right)(x,y)=(aR,b). Recall that for a null ray d s 2 = 0 = d s 2 = 0 = ds^(2)=0=d s^{2}=0=ds2=0=
g 00 d t 2 g x x d x 2 g 00 d t 2 g x x d x 2 g_(00)dt^(2)-g_(xx)dx^(2)g_{00} d t^{2}-g_{x x} d x^{2}g00dt2gxxdx2. It follows that the lapse of coordinate time between transmission and reflection is
t TR = a T a R ( g x x g 00 ) 1 / 2 d x = a T a R [ 1 + ( 1 + γ ) M x 2 + b 2 ] d x t TR = a T a R g x x g 00 1 / 2 d x = a T a R 1 + ( 1 + γ ) M x 2 + b 2 d x t_(TR)=int_(-a_(T))^(a_(R))((g_(xx))/(-g_(00)))^(1//2)dx=int_(-a_(T))^(a_(R))[1+((1+gamma)M_(o.))/(sqrt(x^(2)+b^(2)))]dxt_{\mathrm{TR}}=\int_{-a_{T}}^{a_{R}}\left(\frac{g_{x x}}{-g_{00}}\right)^{1 / 2} d x=\int_{-a_{T}}^{a_{R}}\left[1+\frac{(1+\gamma) M_{\odot}}{\sqrt{x^{2}+b^{2}}}\right] d xtTR=aTaR(gxxg00)1/2dx=aTaR[1+(1+γ)Mx2+b2]dx
Integration yields
(40.13) t TR = a R + a T + ( 1 + γ ) M ln [ ( a R + a R 2 + b 2 ) ( a T + a T 2 + b 2 ) b 2 ] . (40.13) t TR = a R + a T + ( 1 + γ ) M ln a R + a R 2 + b 2 a T + a T 2 + b 2 b 2 . {:(40.13)t_(TR)=a_(R)+a_(T)+(1+gamma)M_(o.)ln[((a_(R)+sqrt(a_(R)^(2)+b^(2)))(a_(T)+sqrt(a_(T)^(2)+b^(2))))/(b^(2))].:}\begin{equation*} t_{\mathrm{TR}}=a_{R}+a_{T}+(1+\gamma) M_{\odot} \ln \left[\frac{\left(a_{R}+\sqrt{a_{R}^{2}+b^{2}}\right)\left(a_{T}+\sqrt{a_{T}^{2}+b^{2}}\right)}{b^{2}}\right] . \tag{40.13} \end{equation*}(40.13)tTR=aR+aT+(1+γ)Mln[(aR+aR2+b2)(aT+aT2+b2)b2].
The lapse of coordinate time in round-trip travel has twice this magnitude. The lapse of proper time measured by an Earth-based clock is
Δ τ = | g 00 | Earth 2 t TR , Δ τ = 2 ( a R + a T ) ( 1 M a T 2 + b 2 ) (40.14) + 2 ( 1 + γ ) M ln [ ( a R + a R 2 + b 2 ) ( a T + a T 2 + b 2 ) b 2 ] Δ τ = g 00 Earth 2 t TR , Δ τ = 2 a R + a T 1 M a T 2 + b 2 (40.14) + 2 ( 1 + γ ) M ln a R + a R 2 + b 2 a T + a T 2 + b 2 b 2 {:[Delta tau=|g_(00)|_(Earth)2t_(TR)","],[Delta tau=2(a_(R)+a_(T))(1-(M_(o.))/(sqrt(a_(T)^(2)+b^(2))))],[(40.14)+2(1+gamma)M_(o.)ln[((a_(R)+sqrt(a_(R)^(2)+b^(2)))(a_(T)+sqrt(a_(T)^(2)+b^(2))))/(b^(2))]]:}\begin{align*} \Delta \tau= & \left|g_{00}\right|_{\mathrm{Earth}} 2 t_{\mathrm{TR}}, \\ \Delta \tau= & 2\left(a_{R}+a_{T}\right)\left(1-\frac{M_{\odot}}{\sqrt{a_{T}^{2}+b^{2}}}\right) \\ & +2(1+\gamma) M_{\odot} \ln \left[\frac{\left(a_{R}+\sqrt{a_{R}^{2}+b^{2}}\right)\left(a_{T}+\sqrt{a_{T}^{2}+b^{2}}\right)}{b^{2}}\right] \tag{40.14} \end{align*}Δτ=|g00|Earth2tTR,Δτ=2(aR+aT)(1MaT2+b2)(40.14)+2(1+γ)Mln[(aR+aR2+b2)(aT+aT2+b2)b2]
(3) formula for delay
This is the lapse of time on an Earth-based clock, aside from corrections for the orbital and rotational motion of the Earth, for the orbital motion of the reflector, for dispersion of radiation traversing the solar wind and corona, and for time dilation in the Earth's gravitational field.
Any reader is reasonable who objects to the form (40.14) in which the time-delay has been written. The quantities a R , a T a R , a T a_(R),a_(T)a_{R}, a_{T}aR,aT, and b b bbb are coordinate positions in the PPN coordinate system, rather than numbers the astronomer can measure directly. They differ from coordinate positions in other, equally good coordinate systems by amounts of the order of M 1.5 km M 1.5 km M_(o.)∼1.5kmM_{\odot} \sim 1.5 \mathrm{~km}M1.5 km. The objection is not mathematical in its origin. The quantities a R , a T a R , a T a_(R),a_(T)a_{R}, a_{T}aR,aT, and b b bbb are perfectly well-defined [with post-post-Newtonian uncertainties of order b ( M / b ) 2 10 6 km ] b M / b 2 10 6 km {:b(M_(o.)//b)^(2) <= 10^(-6)(km)]\left.b\left(M_{\odot} / b\right)^{2} \leqq 10^{-6} \mathrm{~km}\right]b(M/b)2106 km], because the PPN coordinate system is perfectly well-defined. But they are not quantities which the experimenter can measure directly, with precision anywhere near that required to test the relativistic terms in the time-delay formula (40.14).
In practice, fortunately, the experimenter does not need to measure a R , a T a R , a T a_(R),a_(T)a_{R}, a_{T}aR,aT, or b b bbb with high precision. Instead, he checks the time-delay formula by measuring the changes in Δ τ Δ τ Delta tau\Delta \tauΔτ as the Earth and reflector move in their orbits about the Sun; i.e., he measures Δ τ Δ τ Delta tau\Delta \tauΔτ as a function of Earth-based time τ τ tau\tauτ. Notice that when the beam is passing near the sun ( b a R , b a T b a R , b a T (b≪a_(R),b≪a_(T):}\left(b \ll a_{R}, b \ll a_{T}\right.(baR,baT; but d b / d τ d a R / d τ d b / d τ d a R / d τ db//d tau≫da_(R)//d taud b / d \tau \gg d a_{R} / d \taudb/dτdaR/dτ and d b / d τ d a T / d τ d b / d τ d a T / d τ db//d tau≫da_(T)//d taud b / d \tau \gg d a_{T} / d \taudb/dτdaT/dτ because the Earth's and reflector's orbits are nearly circular), the change of b b bbb in the ln ln ln\lnln term of (40.14) dominates all other relativistic corrections to the Newtonian delay; consequently (using d b / d τ 10 km / sec d b / d τ 10 km / sec db//d tau∼10km//secd b / d \tau \sim 10 \mathrm{~km} / \mathrm{sec}db/dτ10 km/sec for typical experiments)
(40.15) d Δ τ d τ ( Constant Newtonian part ) 4 ( 1 + γ ) M b d b d τ 4 ( 1 + γ ) ( 1.5 km 10 6 km ) ( 10 km sec ) 30 μ sec day (40.15) d Δ τ d τ (  Constant Newtonian   part  ) 4 ( 1 + γ ) M b d b d τ 4 ( 1 + γ ) 1.5 km 10 6 km 10 km sec 30 μ sec  day  {:[(40.15)(d Delta tau)/(d tau)-((" Constant Newtonian ")/(" part "))~~-4(1+gamma)(M_(o.))/(b)(db)/(d tau)],[∼4(1+gamma)((1.5(km))/(10^(6)(km)))((10(km))/(sec))∼(30 musec)/(" day ")]:}\begin{align*} \frac{d \Delta \tau}{d \tau}-\binom{\text { Constant Newtonian }}{\text { part }} \approx-4(1 & +\gamma) \frac{M_{\odot}}{b} \frac{d b}{d \tau} \tag{40.15}\\ & \sim 4(1+\gamma)\left(\frac{1.5 \mathrm{~km}}{10^{6} \mathrm{~km}}\right)\left(\frac{10 \mathrm{~km}}{\mathrm{sec}}\right) \sim \frac{30 \mu \mathrm{sec}}{\text { day }} \end{align*}(40.15)dΔτdτ( Constant Newtonian  part )4(1+γ)Mbdbdτ4(1+γ)(1.5 km106 km)(10 kmsec)30μsec day 
(4) comparison with experiment
Such differential shifts in round-trip travel time-which rise as the Earth-reflector line moves toward the Sun and falls as it moves away-are readily observable.
In practice, in order to obtain precisions better than about 20 percent in the determination of the parameter γ γ gamma\gammaγ by time-delay measurements, one must carefully collect and analyze data for a large fraction of a year-from a time when the beam is far from the sun ( b a T 10 8 km ) b a T 10 8 km (b∼a_(T)∼10^(8)(km))\left(b \sim a_{T} \sim 10^{8} \mathrm{~km}\right)(baT108 km), to the time of superior conjunction ( b R b R b∼R_(o.)b \sim R_{\odot}bR 10 6 km 10 6 km ∼10^(6)km\sim 10^{6} \mathrm{~km}106 km ), and on around to a time of distant beam again. Such a long "arc" of data is needed to determine the reflector's orbit with high precision, and to take full advantage of the slow, logarithmic falloff of Δ τ Δ τ Delta tau\Delta \tauΔτ with b b bbb (40.14). When the beam is far from the sun ( b R ) b R (b≫R_(o.))\left(b \gg R_{\odot}\right)(bR), the simplifying assumptions behind equation (40.15) are not valid; and the relativistic time-delay gets intertwined with the orbital motions of the Earth and the reflector. The analysis then remains straightforward, but its details are so complex that one resorts to numerical integrations on a computer to carry it out. Because the orbital motions enter, the time-delay data then contain information about other metric parameters ( β β beta\betaβ is the dominant one) in addition to γ γ gamma\gammaγ.
The experimental results as of 1971 are described in Box 40.2. They yield a value for the PPN parameter γ γ gamma\gammaγ that is more accurate than the value from light and radio-wave deflection experiments:
(40.16) γ = 1.02 ± 0.08 (40.16) γ = 1.02 ± 0.08 {:(40.16)gamma=1.02+-0.08:}\begin{equation*} \gamma=1.02 \pm 0.08 \tag{40.16} \end{equation*}(40.16)γ=1.02±0.08
Future experiments using spacecraft may improve the precision of γ γ gamma\gammaγ to ± 0.001 ± 0.001 +-0.001\pm 0.001±0.001 or better.

EXERCISE

Exercise 40.3. FERMAT'S PRINCIPLE

Prove Fermat's principle for a static gravitational field. [Hint: The proof might proceed as follows. Write down the geodesic equation in four-dimensional spacetime using an affine parameter λ λ lambda\lambdaλ. Convert from the parameter λ λ lambda\lambdaλ to coordinate time t t ttt, and use d s 2 = 0 d s 2 = 0 ds^(2)=0d s^{2}=0ds2=0 to obtain
g j k d 2 x k d t 2 + Γ j k d x k d t d x d t Γ j 00 g k g 00 d x k d t d x d t + d 2 t / d λ 2 ( d t / d λ ) 2 g j k d x k d t = 0 g j k d 2 x k d t 2 + Γ j k d x k d t d x d t Γ j 00 g k g 00 d x k d t d x d t + d 2 t / d λ 2 ( d t / d λ ) 2 g j k d x k d t = 0 g_(jk)(d^(2)x^(k))/(dt^(2))+Gamma_(jkℓ)(dx^(k))/(dt)(dx^(ℓ))/(dt)-Gamma_(j 00)(g_(kℓ))/(g_(00))(dx^(k))/(dt)(dx^(ℓ))/(dt)+(d^(2)t//dlambda^(2))/((dt//d lambda)^(2))g_(jk)(dx^(k))/(dt)=0g_{j k} \frac{d^{2} x^{k}}{d t^{2}}+\Gamma_{j k \ell} \frac{d x^{k}}{d t} \frac{d x^{\ell}}{d t}-\Gamma_{j 00} \frac{g_{k \ell}}{g_{00}} \frac{d x^{k}}{d t} \frac{d x^{\ell}}{d t}+\frac{d^{2} t / d \lambda^{2}}{(d t / d \lambda)^{2}} g_{j k} \frac{d x^{k}}{d t}=0gjkd2xkdt2+ΓjkdxkdtdxdtΓj00gkg00dxkdtdxdt+d2t/dλ2(dt/dλ)2gjkdxkdt=0
Combine with the time part of the geodesic equation
d 2 t / d λ 2 ( d t / d λ ) 2 = 2 Γ 0 k 0 d x k / d t g 00 d 2 t / d λ 2 ( d t / d λ ) 2 = 2 Γ 0 k 0 d x k / d t g 00 (d^(2)t//dlambda^(2))/((dt//d lambda)^(2))=-2Gamma_(0k0)(dx^(k)//dt)/(g_(00))\frac{d^{2} t / d \lambda^{2}}{(d t / d \lambda)^{2}}=-2 \Gamma_{0 k 0} \frac{d x^{k} / d t}{g_{00}}d2t/dλ2(dt/dλ)2=2Γ0k0dxk/dtg00
and use the expression for the Christoffel symbols in terms of the metric to obtain
γ j k d 2 x k d t 2 + 1 2 ( γ j k , + γ j l , k γ k l , j ) d x k d t d x d t = 0 , γ j k g j k g 00 γ j k d 2 x k d t 2 + 1 2 γ j k , + γ j l , k γ k l , j d x k d t d x d t = 0 , γ j k g j k g 00 gamma_(jk)(d^(2)x^(k))/(dt^(2))+(1)/(2)(gamma_(jk,ℓ)+gamma_(jl,k)-gamma_(kl,j))(dx^(k))/(dt)(dx^(ℓ))/(dt)=0,quadgamma_(jk)-=-(g_(jk))/(g_(00))\gamma_{j k} \frac{d^{2} x^{k}}{d t^{2}}+\frac{1}{2}\left(\gamma_{j k, \ell}+\gamma_{j l, k}-\gamma_{k l, j}\right) \frac{d x^{k}}{d t} \frac{d x^{\ell}}{d t}=0, \quad \gamma_{j k} \equiv-\frac{g_{j k}}{g_{00}}γjkd2xkdt2+12(γjk,+γjl,kγkl,j)dxkdtdxdt=0,γjkgjkg00
Then notice that this is a geodesic equation with affine parameter t t ttt in a three-dimensional manifold with metric γ j k γ j k gamma_(jk)\gamma_{j k}γjk. The familiar extremum principle for this geodesic is
δ a j b j ( γ j k d x j d x k ) 1 / 2 = δ a j b i d t = 0 , δ a j b j γ j k d x j d x k 1 / 2 = δ a j b i d t = 0 , deltaint_(a^(j))^(b^(j))(gamma_(jk)dx^(j)dx^(k))^(1//2)=deltaint_(a^(j))^(b^(i))dt=0,\delta \int_{a^{j}}^{b^{j}}\left(\gamma_{j k} d x^{j} d x^{k}\right)^{1 / 2}=\delta \int_{a^{j}}^{b^{i}} d t=0,δajbj(γjkdxjdxk)1/2=δajbidt=0,
which is precisely Fermat's principle!]

Box 40.2 RADAR TIME DELAY IN THE SOLAR SYSTEM: EXPERIMENTAL RESULTS

Two types of experiments have been performed to measure the relativistic effects [proportional to 1 2 ( 1 + γ ) 1 2 ( 1 + γ ) (1)/(2)(1+gamma)\frac{1}{2}(1+\gamma)12(1+γ); equation (40.14)] in the round-trip radar travel time in the solar system. In one type ("passive" experiment) the reflector is the surface of the planet Venus or the planet Mercury. In the other type ("active" experiment) the "reflector" is electronic equipment on board a spacecraft that receives the signal and transmits it back to Earth ("transponder"). Passive experiments suffer from noise due to topography of the reflecting planet (earlier radar return from mountain tops than from valley floors), and they suffer from weakness of the returned signal. Active experiments suffer from buffeting of the spacecraft by solar wind, buffeting by fluctuations in solar radiation pressure, and buffeting by leakage from gas jets ("outgassing"). Experiments of the future will solve these problems by placing a transponder on the surface of a planet or on a "drag-free" (buffetingfree) spacecraft. But experiments of the present and future must both contend with fluctuating time delays due to dispersion in the fluctuating solar wind and corona. Fortunately, these are smaller than the relativistic effects, except when
The Mariner VI spacecraft (mock-up), which was the reflector in a 1970 measurement of 1 2 ( 1 + γ ) 1 2 ( 1 + γ ) (1)/(2)(1+gamma)\frac{1}{2}(1+\gamma)12(1+γ) by radar time delay [photo courtesy the Caltech Jet Propulsion Laboratory].
the beam passes within 2 or 3 solar radii of the sun.
The results of experiments performed before 1972 are listed in the table.
Dates of observation Radar telescopes Reflector
Experimenters
and reference
Experimenters and reference| Experimenters | | :--- | | and reference |
Experimental result a ^("a "){ }^{\text {a }}
1 2 ( 1 + γ ) = 1 2 ( 1 + γ ) = (1)/(2)(1+gamma)=\frac{1}{2}(1+\gamma)=12(1+γ)= Formal standard error Onesigma error
Wave length ( Observed delay ) ( Einstein prediction ) (  Observed   delay  ) (  Einstein   prediction  ) (((" Observed ")/(" delay ")))/(((" Einstein ")/(" prediction ")))\frac{\binom{\text { Observed }}{\text { delay }}}{\binom{\text { Einstein }}{\text { prediction }}}( Observed  delay )( Einstein  prediction )
November 1966 to  November  1966  to  {:[" November "1966],[" to "]:}\begin{aligned} & \text { November } 1966 \\ & \text { to } \end{aligned} November 1966 to  Haystack (MIT) Venus and Mercury Shapiro (1968) 3.8 cm 0.9 ± 0.2 ± 0.2 +-0.2\pm 0.2±0.2
August 1967
1967 1967 196719671967
through
1967 through| $1967$ | | :--- | | through |
Haystack (MIT), and Venus and Mercury Shapiro, Ash, et al. (1971) 3.8 cm 3.8 cm 3.8cm3.8 \mathrm{~cm}3.8 cm and 1.015 ± 0.02 ± 0.02 +-0.02\pm 0.02±0.02 ± 0.05 ± 0.05 +-0.05\pm 0.05±0.05
through 1970 1970 197019701970
and
Arecibo (Cornell)
and Arecibo (Cornell)| and | | :--- | | Arecibo (Cornell) |
Mercury (1971) and 70 cm .  and  70 cm {:[" and "],[70cm". "]:}\begin{gathered} \text { and } \\ 70 \mathrm{~cm} \text {. } \end{gathered} and 70 cm
October 1969 Deep Space Mariner VI Anderson, et al. 14 cm . 1.00 ± 0.014 ± 0.014 +-0.014\pm 0.014±0.014 ± 0.04 ± 0.04 +-0.04\pm 0.04±0.04
to Network and VII (1971)
January 1971 (NASA) spacecraft
Dates of observation Radar telescopes Reflector "Experimenters and reference" Experimental result ^("a ") (1)/(2)(1+gamma)= Formal standard error Onesigma error Wave length (((" Observed ")/(" delay ")))/(((" Einstein ")/(" prediction "))) " November 1966 to " Haystack (MIT) Venus and Mercury Shapiro (1968) 3.8 cm 0.9 +-0.2 August 1967 "1967 through" Haystack (MIT), and Venus and Mercury Shapiro, Ash, et al. (1971) 3.8cm and 1.015 +-0.02 +-0.05 through 1970 "and Arecibo (Cornell)" Mercury (1971) " and 70cm. " October 1969 Deep Space Mariner VI Anderson, et al. 14 cm . 1.00 +-0.014 +-0.04 to Network and VII (1971) January 1971 (NASA) spacecraft | Dates of observation | Radar telescopes | Reflector | Experimenters <br> and reference | Experimental result ${ }^{\text {a }}$ | | | | | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | | | | | | $\frac{1}{2}(1+\gamma)=$ | | Formal standard error | Onesigma error | | | | | | Wave length | $\frac{\binom{\text { Observed }}{\text { delay }}}{\binom{\text { Einstein }}{\text { prediction }}}$ | | | | | | | | | | | | | $\begin{aligned} & \text { November } 1966 \\ & \text { to } \end{aligned}$ | Haystack (MIT) | Venus and Mercury | Shapiro (1968) | 3.8 cm | 0.9 | | $\pm 0.2$ | | August 1967 | | | | | | | | | $1967$ <br> through | Haystack (MIT), and | Venus and Mercury | Shapiro, Ash, et al. (1971) | $3.8 \mathrm{~cm}$ and | 1.015 | $\pm 0.02$ | $\pm 0.05$ | | through $1970$ | and <br> Arecibo (Cornell) | Mercury | (1971) | $\begin{gathered} \text { and } \\ 70 \mathrm{~cm} \text {. } \end{gathered}$ | | | | | October 1969 | Deep Space | Mariner VI | Anderson, et al. | 14 cm . | 1.00 | $\pm 0.014$ | $\pm 0.04$ | | to | Network | and VII | (1971) | | | | | | January 1971 | (NASA) | spacecraft | | | | | |

§40.5. PERIHELION SHIFT AND PERIODIC PERTURBATIONS IN GEODESIC ORBITS

Perihelion shift for geodesic orbits around spherical sun, ignoring preferred-frame effects
The light-deflection and time-delay experiments both measured γ γ gamma\gammaγ. To measure other PPN parameters, one must examine the effects of gravity on slowly moving bodies; this was the message of § 40.2 § 40.2 §40.2\S 40.2§40.2.
Begin with the simplest of cases: the geodesic orbit of a test body in the sun's spherical gravitational field, ignoring all gravitational effects of the planets, of solar oblateness, and of motion relative to any preferred frame. The PPN metric then has the form (40.3):
d s 2 = [ 1 2 M r + 2 β M 2 r 2 ] d t 2 (40.3) + [ 1 + 2 γ M r ] [ d r 2 + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) ] d s 2 = 1 2 M r + 2 β M 2 r 2 d t 2 (40.3) + 1 + 2 γ M r d r 2 + r 2 d θ 2 + sin 2 θ d ϕ 2 {:[ds^(2)=-[1-2(M_(o.))/(r)+2beta(M_(o.)^(2))/(r^(2))]dt^(2)],[(40.3)+[1+2gamma(M_(o.))/(r)][dr^(2)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))]]:}\begin{align*} d s^{2}= & -\left[1-2 \frac{M_{\odot}}{r}+2 \beta \frac{M_{\odot}^{2}}{r^{2}}\right] d t^{2} \\ & +\left[1+2 \gamma \frac{M_{\odot}}{r}\right]\left[d r^{2}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right)\right] \tag{40.3} \end{align*}ds2=[12Mr+2βM2r2]dt2(40.3)+[1+2γMr][dr2+r2(dθ2+sin2θdϕ2)]
Orient the coordinates so the test body moves in the equatorial "plane" θ = π / 2 θ = π / 2 theta=pi//2\theta=\pi / 2θ=π/2; and calculate the shape r ( ϕ ) r ( ϕ ) r(phi)r(\phi)r(ϕ) of its nearly Keplerian, nearly elliptical geodesic orbit. The result, accurate to order M / r M / r M_(o.)//rM_{\odot} / rM/r beyond Newtonian theory, is
(40.17) r = ( 1 e 2 ) a 1 + e cos [ ( 1 δ ϕ 0 / 2 π ) ϕ ] (40.17) r = 1 e 2 a 1 + e cos 1 δ ϕ 0 / 2 π ϕ {:(40.17)r=((1-e^(2))a)/(1+e cos[(1-deltaphi_(0)//2pi)phi]):}\begin{equation*} r=\frac{\left(1-e^{2}\right) a}{1+e \cos \left[\left(1-\delta \phi_{0} / 2 \pi\right) \phi\right]} \tag{40.17} \end{equation*}(40.17)r=(1e2)a1+ecos[(1δϕ0/2π)ϕ]
where a a aaa and e e eee are constants of integration, and δ ϕ 0 δ ϕ 0 deltaphi_(0)\delta \phi_{0}δϕ0 is defined by
δ ϕ 0 = ( 2 β + 2 γ ) 3 6 π M a ( 1 e 2 ) (40.18) = 6 π M a ( 1 e 2 ) in general relativity. δ ϕ 0 = ( 2 β + 2 γ ) 3 6 π M a 1 e 2 (40.18) = 6 π M a 1 e 2  in general relativity.  {:[deltaphi_(0)=((2-beta+2gamma))/(3)(6piM_(o.))/(a(1-e^(2)))],[(40.18)=(6piM_(o.))/(a(1-e^(2)))" in general relativity. "]:}\begin{align*} \delta \phi_{0} & =\frac{(2-\beta+2 \gamma)}{3} \frac{6 \pi M_{\odot}}{a\left(1-e^{2}\right)} \\ & =\frac{6 \pi M_{\odot}}{a\left(1-e^{2}\right)} \text { in general relativity. } \tag{40.18} \end{align*}δϕ0=(2β+2γ)36πMa(1e2)(40.18)=6πMa(1e2) in general relativity. 
(For derivation, see exercise 40.4.)
Notice that, if δ ϕ 0 δ ϕ 0 deltaphi_(0)\delta \phi_{0}δϕ0 were zero-as it is in the Newtonian limit-then the orbit (40.18) would be an ellipse with semimajor axis a a aaa and eccentricity e e eee (see Box 25.4). The constant δ ϕ 0 δ ϕ 0 deltaphi_(0)\delta \phi_{0}δϕ0 merely makes the ellipse precess: for r r rrr to go through a complete circuit, from perihelion to aphelion to perihelion again, ( 1 δ ϕ 0 / 2 π ) ϕ 1 δ ϕ 0 / 2 π ϕ (1-deltaphi_(0)//2pi)phi\left(1-\delta \phi_{0} / 2 \pi\right) \phi(1δϕ0/2π)ϕ must change by 2 π 2 π 2pi2 \pi2π; so ϕ ϕ phi\phiϕ must change by 2 π + δ ϕ ˙ 0 2 π + δ ϕ ˙ 0 2pi+deltaphi^(˙)_(0)2 \pi+\delta \dot{\phi}_{0}2π+δϕ˙0. Thus, the perihelion shifts forward by an angle δ ϕ 0 δ ϕ 0 deltaphi_(0)\delta \phi_{0}δϕ0 with each circuit around the ellipse.
Relative to what does the perihelion shift? (1) Relative to the PPN coordinate system; (2) relative to inertial frames at the outskirts of the solar system (since the PPN coordinates are tied to those frames; see § 39.12 § 39.12 §39.12\S 39.12§39.12 ); (3) relative to a frame determined by the "fixed stars in the sky" (since the inertial frames at the outskirts of the solar system, inertial frames elsewhere in our galaxy, and inertial frames in our cluster of galaxies should not rotate significantly relative to each other); (4) relative to the perihelia of (other) planets, which themselves are shifting at calculable rates that decrease as one moves outward in the solar system from Mercury to Venus to Earth to....
The perihelion shift is not the only relativistic effect contained in the orbital motion for a test body. There are other effects, but they are all periodic rather than cumulative with time; so, with the limited technology of the pre-space-age era, it was impossible to detect them. But the technology of the 1970's is bringing them within reach. Moreover, many space-age experiments are necessarily of short duration ( £ £ £££ one orbit)-particularly those involving spacecraft and transponders landed on planets. For these, the periodic perturbations in an orbit are of almost as much experimental value as the cumulative perihelion shift. The periodic effects are not obvious in the PPN orbital equation (40.17); it looks like the simplest of precessing ellipses. But the quantities the observer measures directly are not a , e a , e a,ea, ea,e, and δ ϕ 0 δ ϕ 0 deltaphi_(0)\delta \phi_{0}δϕ0. Instead, he measures the time evolution of round-trip radar travel times, Δ τ ( τ ) Δ τ ( τ ) Delta tau(tau)\Delta \tau(\tau)Δτ(τ), and of angular positions on the sky [ θ 0 ( τ ) , ϕ 0 ( τ ) ] θ 0 ( τ ) , ϕ 0 ( τ ) [theta_(0)(tau),phi_(0)(tau)]\left[\theta_{0}(\tau), \phi_{0}(\tau)\right][θ0(τ),ϕ0(τ)]. To compute these quantities is perfectly straightforward in principle, but in practice is a very complex task. The calculations predict relativistic effects periodic with the frequency of the orbit and all its harmonics. The amplitudes of these effects, for the lower harmonics, must obviously be of the order of M 1 km 10 μ sec 0 .01 M 1 km 10 μ sec 0 .01 M_(o.)∼1km∼10 musec∼0^('').01M_{\odot} \sim 1 \mathrm{~km} \sim 10 \mu \mathrm{sec} \sim 0^{\prime \prime} .01M1 km10μsec0.01 arc on the sky. (The distance M = 1.48 km M = 1.48 km M_(o.)=1.48kmM_{\odot}=1.48 \mathrm{~km}M=1.48 km is the characteristic length for all relativistic effects in the sun's spherical field!)
The most favorable orbits for experimental tests of the perihelion shift and of periodic effects are those that go nearest the sun and have the highest eccentricity [see equation (40.18)]-the orbits of Mercury, Venus, Earth, Mars, and the asteroid Icarus. But how does one know that these orbits are geodesics? After all, planets are not "test bodies"; they themselves produce nonnegligible curvature in spacetime. It turns out (see § 40.9 § 40.9 §40.9\S 40.9§40.9 for full discussion) that there should exist tiny deviations from geodesic motion, but they are too small to compete with the perihelion shift or with the periodic effects discussed above, at least for these five bodies.
Extensive astronomical observations of planetary orbits, dating back to the midnineteenth century and aided by radar since 1966, have yielded accurate values for planetary perihelion shifts (accurate to ± 0.4 ± 0.4 +-0.4\pm 0.4±0.4 seconds of arc per century for Mercury). From the data, which are summarized and discussed in Box 40.3, one obtains the value
(40.19a) 1 3 ( 2 β + 2 γ ) = 1.00 { + 0.01 0.10 (40.19a) 1 3 ( 2 β + 2 γ ) = 1.00 + 0.01 0.10 {:(40.19a)(1)/(3)(2-beta+2gamma)=1.00{[+0.01],[-0.10]:}:}\frac{1}{3}(2-\beta+2 \gamma)=1.00\left\{\begin{array}{c} +0.01 \tag{40.19a}\\ -0.10 \end{array}\right.(40.19a)13(2β+2γ)=1.00{+0.010.10
for the ratio of observed relativistic shift to general relativistic prediction. Combining this result with the radar-delay value for γ γ gamma\gammaγ (40.16), one obtains a value
(40.19b) β = 1.0 { + 0.4 0.2 (40.19b) β = 1.0 + 0.4 0.2 {:(40.19b)beta=1.0{[+0.4],[-0.2]:}:}\beta=1.0\left\{\begin{array}{c} +0.4 \tag{40.19b}\\ -0.2 \end{array}\right.(40.19b)β=1.0{+0.40.2
for the PPN parameter β β beta\betaβ. (Recall: β β beta\betaβ measures the "amount of nonlinearity in the superposition law for g 00 g 00 g_(00)g_{00}g00.")
The periodic effects in the planetary orbits have not yet (1973) been studied experimentally.
The above discussion and Box 40.3 have ignored the motion of the solar system relative to the preferred frame (if one exists); i.e., they have ignored the terms (40.3')
Periodic perturbations in geodesic orbits
Comparison of theory with planetary orbits
Experimental result for β β beta\betaβ
Box 40.3 PERIHELION SHIFTS; EXPERIMENTAL RESULTS
Relativistic corrections to Newtonian theory are not the only cause of shift in the perihelion of a planetary orbit. Any departure of the Newtonian gravitational field from its idealized, spherical, inverse-square-law form also produces a shift. Such nonsphericities and resulting shifts are brought about by (1) the gravitational pulls of other planets, and (2) deformation of the sun ("solar oblateness"; "quadrupole moment"). In addition, when the primary data are optical positions of planets on the sky (right ascension and declination as functions of time), there is an apparent perihelion shift caused by the precession of the Earth's axis ("general precession"; observer not on a "stable platform"; see exercise 16.4).
The perihelion shifts due to a general precession and to the gravitational pulls of other planets can be calculated with high precision. But in 1973 there is no fully reliable way to determine the solar quadrupole moment. It is conventional to quantify the sun's quadrupole moment by a dimensionless parameter J 2 J 2 J_(2)J_{2}J2, which appears in the following expression for the Newtonian potential,
U = M r [ 1 J 2 R 2 r 2 ( 3 cos 2 θ 1 2 ) ] . U = M r 1 J 2 R 2 r 2 3 cos 2 θ 1 2 . U=(M_(o.))/(r)[1-J_(2)(R_(o.)^(2))/(r^(2))((3cos^(2)theta-1)/(2))].U=\frac{M_{\odot}}{r}\left[1-J_{2} \frac{R_{\odot}{ }^{2}}{r^{2}}\left(\frac{3 \cos ^{2} \theta-1}{2}\right)\right] .U=Mr[1J2R2r2(3cos2θ12)].
If the sun were rotating near breakup velocity, J 2 J 2 J_(2)J_{2}J2 would be near 1 . Very careful measurements of the optical shape of the sun [Dicke and Goldenberg (1967)] show a flattening, which suggests J 2 J 2 J_(2)J_{2}J2 may be near 3 × 10 5 3 × 10 5 3xx10^(-5)3 \times 10^{-5}3×105.
The total perihelion shift produced by relativity plus solar quadrupole moment is (see exercise 40.5)
δ ϕ = 6 π M a ( 1 e 2 ) λ p , λ p 2 β + 2 γ 3 + J 2 R 2 / M 2 a ( 1 e 2 ) . δ ϕ = 6 π M a 1 e 2 λ p , λ p 2 β + 2 γ 3 + J 2 R 2 / M 2 a 1 e 2 . {:[delta phi=(6piM_(o.))/(a(1-e^(2)))lambda_(p)","],[lambda_(p)-=(2-beta+2gamma)/(3)+J_(2)(R_(o.)^(2)//M_(o.))/(2a(1-e^(2))).]:}\begin{aligned} \delta \phi & =\frac{6 \pi M_{\odot}}{a\left(1-e^{2}\right)} \lambda_{p}, \\ \lambda_{p} & \equiv \frac{2-\beta+2 \gamma}{3}+J_{2} \frac{R_{\odot}{ }^{2} / M_{\odot}}{2 a\left(1-e^{2}\right)} . \end{aligned}δϕ=6πMa(1e2)λp,λp2β+2γ3+J2R2/M2a(1e2).
The Haystack radar antenna, which Irwin Shapiro and his group have used to collect extensive data on the systematics of the inner part of the solar system. Those data are rapidly becoming the most important source of information about perihelion shifts. (Picture courtesy of Lincoln Laboratories, MIT.)
Note that relativistic and quadrupole shifts have different dependences on the semimajor axis a a aaa and ecentricity e e eee of the orbit. This difference in dependence allows one to obtain values for both the quadrupole moment parameter J 2 J 2 J_(2)J_{2}J2, and the PPN parameter 1 3 ( 2 β + 2 γ ) 1 3 ( 2 β + 2 γ ) (1)/(3)(2-beta+2gamma)\frac{1}{3}(2-\beta+2 \gamma)13(2β+2γ) by combining measurements of δ ϕ δ ϕ delta phi\delta \phiδϕ for more than one planet.
The experimental results, as of 1972 , are as follows.
I. Data for Mercury from optical studies [Clemence (1943, 1947)] ^(**){ }^{*}
(general relativity with no solar oblateness predicts 43.03 / 43.03 / 43.03^('')//43.03^{\prime \prime} /43.03/ century)
Quantity Value
(a) Total observed shift per century 5599 " .74 ± 0 .41 5599 " .74 ± 0 .41 5599".74+-0^('').415599 " .74 \pm 0^{\prime \prime} .415599".74±0.41
(b) Contribution to shift caused by observer not being in an inertial frame far from
the sun ("general precession" as evaluated in 1947) 5025 .645 ± 0 .50 5025 .645 ± 0 .50 5025^('').645+-0^('').505025^{\prime \prime} .645 \pm 0^{\prime \prime} .505025.645±0.50
(c) Shift per century produced by Newtonian gravitation of other planets 531 " .54 ± 0 " .68 531 " .54 ± 0 " .68 531".54+-0".68531 " .54 \pm 0 " .68531".54±0".68
(d) Residual shift per century to be attributed to general relativity plus
solar oblateness 42 .56 ± 0 .94 42 .56 ± 0 .94 42^('').56+-0^('').9442^{\prime \prime} .56 \pm 0^{\prime \prime} .9442.56±0.94
(e) Residual shift if one uses the 1973 value for the "general precession" 41 .4 ± 0 .90 41 .4 ± 0 .90 41^('').4+-0^('').9041^{\prime \prime} .4 \pm 0^{\prime \prime} .9041.4±0.90
(f) Corresponding value of λ p λ p lambda_(p)\lambda_{p}λp (see above) λ p = 0.96 ± 0.02 λ p = 0.96 ± 0.02 lambda_(p)=0.96+-0.02\lambda_{p}=0.96 \pm 0.02λp=0.96±0.02
Quantity Value (a) Total observed shift per century 5599.74+-0^('').41 (b) Contribution to shift caused by observer not being in an inertial frame far from the sun ("general precession" as evaluated in 1947) 5025^('').645+-0^('').50 (c) Shift per century produced by Newtonian gravitation of other planets 531.54+-0.68 (d) Residual shift per century to be attributed to general relativity plus solar oblateness 42^('').56+-0^('').94 (e) Residual shift if one uses the 1973 value for the "general precession" 41^('').4+-0^('').90 (f) Corresponding value of lambda_(p) (see above) lambda_(p)=0.96+-0.02| Quantity | Value | | :--- | :--- | | (a) Total observed shift per century | $5599 " .74 \pm 0^{\prime \prime} .41$ | | (b) Contribution to shift caused by observer not being in an inertial frame far from | | | the sun ("general precession" as evaluated in 1947) | $5025^{\prime \prime} .645 \pm 0^{\prime \prime} .50$ | | (c) Shift per century produced by Newtonian gravitation of other planets | $531 " .54 \pm 0 " .68$ | | (d) Residual shift per century to be attributed to general relativity plus | | | solar oblateness | $42^{\prime \prime} .56 \pm 0^{\prime \prime} .94$ | | (e) Residual shift if one uses the 1973 value for the "general precession" | $41^{\prime \prime} .4 \pm 0^{\prime \prime} .90$ | | (f) Corresponding value of $\lambda_{p}$ (see above) | $\lambda_{p}=0.96 \pm 0.02$ |
II. 1970 Results of Shapiro (1970, 1971a,b), Shapiro et al. (1972)
(a) Values of λ p λ p lambda_(p)\lambda_{p}λp obtained by reanalyzing all the world's collection of optical data, and combining it with radar data
(b) Value of J 2 J 2 J_(2)J_{2}J2 obtained by comparing the observed shifts for Mercury and Mars
{ ( λ p ) Mercury = 1.00 ± 0.01 ( λ p ) Mars = 1.07 ± 0.10 λ p Mercury  = 1.00 ± 0.01 λ p Mars  = 1.07 ± 0.10 {[(lambda_(p))_("Mercury ")=1.00+-0.01],[(lambda_(p))_("Mars ")=1.07+-0.10]:}\left\{\begin{array}{l}\left(\lambda_{p}\right)_{\text {Mercury }}=1.00 \pm 0.01 \\ \left(\lambda_{p}\right)_{\text {Mars }}=1.07 \pm 0.10\end{array}\right.{(λp)Mercury =1.00±0.01(λp)Mars =1.07±0.10
J 2 3 × 10 5 J 2 3 × 10 5 J_(2) <= 3xx10^(-5)J_{2} \leqq 3 \times 10^{-5}J23×105
III. Theoretical implications of Shapiro's results
(a) Value of ( 2 β + 2 γ ) / 3 ( 2 β + 2 γ ) / 3 (2-beta+2gamma)//3(2-\beta+2 \gamma) / 3(2β+2γ)/3
1.00 { + 0.01 0.10 1.00 + 0.01 0.10 1.00{[+0.01],[-0.10]:}1.00\left\{\begin{array}{l} +0.01 \\ -0.10 \end{array}\right.1.00{+0.010.10
(b) Value of β β beta\betaβ obtained by combining with γ γ gamma\gammaγ from time delay experiments [equation (40.16)]
1.0 { + 0.4 0.2 1.0 + 0.4 0.2 1.0{[+0.4],[-0.2]:}1.0\left\{\begin{array}{l}+0.4 \\ -0.2\end{array}\right.1.0{+0.40.2
*Clemence (1947) notes, "The observations cannot be made in a Newtonian frame of reference. They are referred to the moving equinox, that is, they are affected by the precession of the equinoxes, and the determination of the precessional motion is one of the most difficult problems of observational astronomy, if not the most difficult. In the light of all these hazards, it is not surprising that a difference of opinion could exist regarding the closeness of agreement between the observed and theoretical motions."
in the sun's metric. When one takes account of these terms, one finds an additional contribution to the perihelion shift, given for small eccentricities e 1 e 1 e≪1e \ll 1e1 by
(40.20) δ ϕ 0 = α 1 π 2 e ( M a ) 1 / 2 w Q α 2 π 4 [ ( w P ) 2 ( w Q ) 2 ] + α 3 π e ( | Ω | M ) ( ω a 2 M ) w Q (40.20) δ ϕ 0 = α 1 π 2 e M a 1 / 2 w Q α 2 π 4 ( w P ) 2 ( w Q ) 2 + α 3 π e Ω M ω a 2 M w Q {:[(40.20)deltaphi_(0)=-alpha_(1)(pi)/(2e)((M_(o.))/(a))^(1//2)w*Q-alpha_(2)(pi)/(4)[(w*P)^(2)-(w*Q)^(2)]],[+alpha_(3)(pi )/(e)((|Omega_(o.)|)/(M_(o.)))((omega_(o.)a^(2))/(M_(o.)))w*Q]:}\begin{align*} \delta \phi_{0}= & -\alpha_{1} \frac{\pi}{2 e}\left(\frac{M_{\odot}}{a}\right)^{1 / 2} w \cdot \boldsymbol{Q}-\alpha_{2} \frac{\pi}{4}\left[(\boldsymbol{w} \cdot \boldsymbol{P})^{2}-(\boldsymbol{w} \cdot \boldsymbol{Q})^{2}\right] \tag{40.20}\\ & +\alpha_{3} \frac{\pi}{e}\left(\frac{\left|\Omega_{\odot}\right|}{M_{\odot}}\right)\left(\frac{\omega_{\odot} a^{2}}{M_{\odot}}\right) \boldsymbol{w} \cdot \boldsymbol{Q} \end{align*}(40.20)δϕ0=α1π2e(Ma)1/2wQα2π4[(wP)2(wQ)2]+α3πe(|Ω|M)(ωa2M)wQ
[see Nordtvedt and Will (1972)]. Here M , Ω M , Ω M_(o.),Omega_(o.)M_{\odot}, \Omega_{\odot}M,Ω, and ω ω omega_(o.)\omega_{\odot}ω are the sun's mass, self-gravitational energy, and rotational angular velocity; w w w\boldsymbol{w}w is the sun's velocity relative to the preferred frame; a a aaa and e e eee are the semimajor axis and eccentricity of the orbit; P P P\boldsymbol{P}P is the unit vector pointing from the sun to the perihelion; and Q Q Q\boldsymbol{Q}Q is a unit vector orthogonal to P P P\boldsymbol{P}P and lying in the orbital plane. Comparison with observations for
Perihelion shift due to preferred-frame forces
Mercury-and combination with limits on α 1 α 1 alpha_(1)\alpha_{1}α1 and α 2 α 2 alpha_(2)\alpha_{2}α2 discussed below [equations (40.46b) and (40.48)]-yields the limit
(40.21a) | α 3 w Q 200 km / sec | 2 × 10 5 . (40.21a) α 3 w Q 200 km / sec 2 × 10 5 . {:(40.21a)|alpha_(3)(w*Q)/(200(km)//sec)| <= 2xx10^(-5).:}\begin{equation*} \left|\alpha_{3} \frac{w \cdot \boldsymbol{Q}}{200 \mathrm{~km} / \mathrm{sec}}\right| \leqq 2 \times 10^{-5} . \tag{40.21a} \end{equation*}(40.21a)|α3wQ200 km/sec|2×105.
Since the velocity of the sun around the Galaxy is 200 km / sec 200 km / sec ∼200km//sec\sim 200 \mathrm{~km} / \mathrm{sec}200 km/sec, and the peculiar motion of the Galaxy relative to other nearby galaxies is 200 km / sec 200 km / sec ∼200km//sec\sim 200 \mathrm{~km} / \mathrm{sec}200 km/sec, a value w 200 km / sec w 200 km / sec w∼200km//sec\boldsymbol{w} \sim 200 \mathrm{~km} / \mathrm{sec}w200 km/sec is reasonable. Moreover, there is no reason to believe that w w w\boldsymbol{w}w and Q Q Q\boldsymbol{Q}Q are orthogonal, so one is fairly safe in concluding
(40.21b) | α 3 | = | 4 β 1 2 γ 2 ζ | 2 × 10 5 (40.21b) α 3 = 4 β 1 2 γ 2 ζ 2 × 10 5 {:(40.21b)|alpha_(3)|=|4beta_(1)-2gamma-2-zeta|≲2xx10^(-5):}\begin{equation*} \left|\alpha_{3}\right|=\left|4 \beta_{1}-2 \gamma-2-\zeta\right| \lesssim 2 \times 10^{-5} \tag{40.21b} \end{equation*}(40.21b)|α3|=|4β12γ2ζ|2×105
This is a stringent limit on theories that possess universal rest frames. For example, with great certainty it rules out a theory devised by Coleman (1971), which has β = γ = 1 β = γ = 1 beta=gamma=1\beta=\gamma=1β=γ=1, but α 3 = 4 α 3 = 4 alpha_(3)=-4\alpha_{3}=-4α3=4; see Ni (1972).
Looking toward the future, one cannot expect data on orbits of spacecraft to give decisive tests of general relativity, despite the high precision ( 10 10 ∼10\sim 1010 meters in 1972) with which spacecraft can be tracked. Spacecraft are buffeted by the solar wind. They respond to fluctuations in this wind and in the pressure of solar radiation, and respond also to "outgassing" from leaky jets. Unless one can develop excellent "drag-free" or "conscience-guided" spacecraft, one must therefore continue to rely on planets as the source of data on geodesics. However, planetary data themselves can be greatly improved in the future by placing radar transponders on the surfaces of planets or in orbit about them, by improvements in radar technology, and by the continued accumulation of more and more observations.

EXERCISES

Exercise 40.4. DERIVATION OF PERIHELION SHIFT IN PPN FORMALISM

[See exercise 25.16 for a derivation in general relativity, accurate when gravity is strong ( 2 M / r 2 M / r 2M//r2 M / r2M/r as large as 1 3 1 3 (1)/(3)\frac{1}{3}13 ) but the orbital eccentricity is small. The present exercise applies to any "metric theory" and to any eccentricity, but it assumes gravity is weak ( 2 M / r 1 2 M / r 1 2M//r≪12 M / r \ll 12M/r1 ) and ignores motion relative to any universal rest frame.] Derive equation (40.17) for the shape of any bound orbit of a test particle moving in the equatorial plane of the PPN gravitational field (40.3). Keep only "first-order" corrections beyond Newtonian theory (first order in powers of M / r M / r M_(o.)//rM_{\odot} / rM/r ). [Sketch of solution using Hamilton-Jacobi theory (Box 25.4): (1) Hamilton-Jacobi equation, referred to a test body of unit mass, is
1 = g α β S ~ , α S ~ , β = [ 1 + 2 M r + ( 4 2 β ) ( M r ) 2 ] ( S ~ t ) 2 + [ 1 2 γ M r ] [ ( S ~ r ) 2 + 1 r 2 ( S ~ ϕ ) 2 ] 1 = g α β S ~ , α S ~ , β = 1 + 2 M r + ( 4 2 β ) M r 2 S ~ t 2 + 1 2 γ M r S ~ r 2 + 1 r 2 S ~ ϕ 2 {:[-1=g^(alpha beta) widetilde(S)_(,alpha) widetilde(S)_(,beta)],[=-[1+2(M_(o.))/(r)+(4-2beta)((M_(o.))/(r))^(2)]((del( widetilde(S)))/(del t))^(2)+[1-2gamma(M_(o.))/(r)][((del( widetilde(S)))/(del r))^(2)+(1)/(r^(2))((del( widetilde(S)))/(del phi))^(2)]]:}\begin{aligned} -1 & =g^{\alpha \beta} \widetilde{S}_{, \alpha} \widetilde{S}_{, \beta} \\ & =-\left[1+2 \frac{M_{\odot}}{r}+(4-2 \beta)\left(\frac{M_{\odot}}{r}\right)^{2}\right]\left(\frac{\partial \widetilde{S}}{\partial t}\right)^{2}+\left[1-2 \gamma \frac{M_{\odot}}{r}\right]\left[\left(\frac{\partial \widetilde{S}}{\partial r}\right)^{2}+\frac{1}{r^{2}}\left(\frac{\partial \widetilde{S}}{\partial \phi}\right)^{2}\right] \end{aligned}1=gαβS~,αS~,β=[1+2Mr+(42β)(Mr)2](S~t)2+[12γMr][(S~r)2+1r2(S~ϕ)2]
(2) Solution to Hamilton-Jacobi equation is
(40.22) S ~ = E ~ t + L ~ ϕ ± r { ( 1 E ~ 2 ) + 2 M r [ 1 ( 1 + γ ) ( 1 E ~ 2 ) ] L ~ 2 r 2 [ 1 2 M 2 L ~ 2 ( 2 β + 2 γ ) ] } 1 / 2 d r (40.22) S ~ = E ~ t + L ~ ϕ ± r 1 E ~ 2 + 2 M r 1 ( 1 + γ ) 1 E ~ 2 L ~ 2 r 2 1 2 M 2 L ~ 2 ( 2 β + 2 γ ) 1 / 2 d r {:[(40.22) widetilde(S)=- widetilde(E)t+ widetilde(L)phi+-int^(r){-(1- widetilde(E)^(2))+(2M_(o.))/(r)[1-(1+gamma)(1- widetilde(E)^(2))]:}],[-( widetilde(L)^(2))/(r^(2))[1-(2M_(o.)^(2))/( widetilde(L)^(2))(2-beta+2gamma)]}^(1//2)dr]:}\begin{align*} \widetilde{S}= & -\widetilde{E} t+\widetilde{L} \phi \pm \int^{r}\left\{-\left(1-\widetilde{E}^{2}\right)+\frac{2 M_{\odot}}{r}\left[1-(1+\gamma)\left(1-\widetilde{E}^{2}\right)\right]\right. \tag{40.22}\\ & \left.-\frac{\widetilde{L}^{2}}{r^{2}}\left[1-\frac{2 M_{\odot}{ }^{2}}{\widetilde{L}^{2}}(2-\beta+2 \gamma)\right]\right\}^{1 / 2} d r \end{align*}(40.22)S~=E~t+L~ϕ±r{(1E~2)+2Mr[1(1+γ)(1E~2)]L~2r2[12M2L~2(2β+2γ)]}1/2dr
where post-post-Newtonian corrections have been discarded. In discarding post-post-Newtonian corrections, recall that E ~ E ~ widetilde(E)\widetilde{E}E~ is the conserved energy per unit rest mass and L ~ L ~ widetilde(L)\widetilde{L}L~ is the angular momentum per unit rest mass (see Box 25.4). Consequently one has the order-of-magnitude relations
1 E ~ 2 ( velocity of test body ) 2 M / r 1 E ~ 2 (  velocity of test body  ) 2 M / r 1- widetilde(E)^(2)∼(" velocity of test body ")^(2)∼M_(o.)//r1-\widetilde{E}^{2} \sim(\text { velocity of test body })^{2} \sim M_{\odot} / r1E~2( velocity of test body )2M/r
and
( M / L ~ ) 2 ( M / r v ) 2 M / r M / L ~ 2 M / r v 2 M / r (M_(o.)//( widetilde(L)))^(2)∼(M_(o.)//rv)^(2)∼M_(o.)//r\left(M_{\odot} / \widetilde{L}\right)^{2} \sim\left(M_{\odot} / r v\right)^{2} \sim M_{\odot} / r(M/L~)2(M/rv)2M/r
(3) The shape of the orbit is determined by the "condition of constructive interference," S ~ / L ~ = 0 : S ~ / L ~ = 0 : del widetilde(S)//del widetilde(L)=0:\partial \widetilde{S} / \partial \widetilde{L}=0:S~/L~=0:
ϕ = ± { 1 E ~ 2 L ~ 2 + 2 M L ~ 2 r [ 1 ( 1 + γ ) ( 1 E ~ 2 ) ] 1 r 2 [ 1 2 M 2 L ~ 2 ( 2 β + 2 γ ) ] } 1 / 2 d ( 1 / r ) ϕ = ± { 1 E ~ 2 L ~ 2 + 2 M L ~ 2 r 1 ( 1 + γ ) 1 E ~ 2 1 r 2 1 2 M 2 L ~ 2 ( 2 β + 2 γ ) 1 / 2 d ( 1 / r ) {:[phi=+-int{-(1- widetilde(E)^(2))/( widetilde(L)^(2))+(2M_(o.))/( widetilde(L)^(2)r)[1-(1+gamma)(1- widetilde(E)^(2))]],[-(1)/(r^(2))[1-(2M_(o.)^(2))/( widetilde(L)^(2))(2-beta+2gamma)]}^(-1//2)d(1//r)]:}\begin{aligned} \phi= \pm \int\{ & -\frac{1-\widetilde{E}^{2}}{\widetilde{L}^{2}}+\frac{2 M_{\odot}}{\widetilde{L}^{2} r}\left[1-(1+\gamma)\left(1-\widetilde{E}^{2}\right)\right] \\ & \left.-\frac{1}{r^{2}}\left[1-\frac{2 M_{\odot}^{2}}{\widetilde{L}^{2}}(2-\beta+2 \gamma)\right]\right\}^{-1 / 2} d(1 / r) \end{aligned}ϕ=±{1E~2L~2+2ML~2r[1(1+γ)(1E~2)]1r2[12M2L~2(2β+2γ)]}1/2d(1/r)
(4) This integral is readily evaluated in terms of trigonometric functions. For a bound orbit ( E ~ < 1 E ~ < 1 widetilde(E) < 1\widetilde{E}<1E~<1 ), the integral is
ϕ = ( 1 + δ ϕ 0 2 π ) cos 1 [ ( 1 e 2 ) a e r 1 e ] ϕ = 1 + δ ϕ 0 2 π cos 1 1 e 2 a e r 1 e phi=(1+(deltaphi_(0))/(2pi))cos^(-1)[((1-e^(2))a)/(er)-(1)/(e)]\phi=\left(1+\frac{\delta \phi_{0}}{2 \pi}\right) \cos ^{-1}\left[\frac{\left(1-e^{2}\right) a}{e r}-\frac{1}{e}\right]ϕ=(1+δϕ02π)cos1[(1e2)aer1e]
where
a M 1 E ~ 2 [ 1 ( 1 + γ ) ( 1 E ~ 2 ) ] (40.23) 1 e 2 ( L ~ M ) 2 ( 1 E ~ 2 ) [ 1 + 2 ( 1 + γ ) ( 1 E ~ 2 ) 2 ( M L ~ ) 2 ( 2 β + 2 γ ) ] δ ϕ 0 1 3 ( 2 β + 2 γ ) 6 π ( M / L ~ ) 2 a M 1 E ~ 2 1 ( 1 + γ ) 1 E ~ 2 (40.23) 1 e 2 L ~ M 2 1 E ~ 2 1 + 2 ( 1 + γ ) 1 E ~ 2 2 M L ~ 2 ( 2 β + 2 γ ) δ ϕ 0 1 3 ( 2 β + 2 γ ) 6 π M / L ~ 2 {:[a-=(M_(o.))/(1- widetilde(E)^(2))[1-(1+gamma)(1- widetilde(E)^(2))]],[(40.23)1-e^(2)-=((( widetilde(L)))/(M_(o.)))^(2)(1- widetilde(E)^(2))[1+2(1+gamma)(1- widetilde(E)^(2))-2((M_(o.))/(( widetilde(L))))^(2)(2-beta+2gamma)]],[deltaphi_(0)-=(1)/(3)(2-beta+2gamma)6pi(M_(o.)//( widetilde(L)))^(2)]:}\begin{gather*} a \equiv \frac{M_{\odot}}{1-\widetilde{E}^{2}}\left[1-(1+\gamma)\left(1-\widetilde{E}^{2}\right)\right] \\ 1-e^{2} \equiv\left(\frac{\widetilde{L}}{M_{\odot}}\right)^{2}\left(1-\widetilde{E}^{2}\right)\left[1+2(1+\gamma)\left(1-\widetilde{E}^{2}\right)-2\left(\frac{M_{\odot}}{\widetilde{L}}\right)^{2}(2-\beta+2 \gamma)\right] \tag{40.23}\\ \delta \phi_{0} \equiv \frac{1}{3}(2-\beta+2 \gamma) 6 \pi\left(M_{\odot} / \widetilde{L}\right)^{2} \end{gather*}aM1E~2[1(1+γ)(1E~2)](40.23)1e2(L~M)2(1E~2)[1+2(1+γ)(1E~2)2(ML~)2(2β+2γ)]δϕ013(2β+2γ)6π(M/L~)2
(5) Straightforward manipulations bring this result into the form of equations (40.17) and (40.18).]

Exercise 40.5. PERIHELION SHIFT FOR OBLATE SUN

(a) The Newtonian potential for an oblate sun has the form
(40.24) U = M r ( 1 J 2 R 2 r 2 3 cos 2 θ 1 2 ) (40.24) U = M r 1 J 2 R 2 r 2 3 cos 2 θ 1 2 {:(40.24)U=(M_(o.))/(r)(1-J_(2)(R_(o.)^(2))/(r^(2))(3cos^(2)theta-1)/(2)):}\begin{equation*} U=\frac{M_{\odot}}{r}\left(1-J_{2} \frac{R_{\odot}{ }^{2}}{r^{2}} \frac{3 \cos ^{2} \theta-1}{2}\right) \tag{40.24} \end{equation*}(40.24)U=Mr(1J2R2r23cos2θ12)
where J 2 J 2 J_(2)J_{2}J2 is the "quadrupole-moment parameter." One knows that J 2 3 × 10 5 J 2 3 × 10 5 J_(2) <= 3xx10^(-5)J_{2} \leqq 3 \times 10^{-5}J23×105. Show that if an oblate sun is at rest at the origin of the PPN coordinate system, the metric of the surrounding spacetime [equations (39.32)] can be put into the form
d s 2 = [ 1 2 M r 2 J 2 ( M R 2 r 3 ) ( 3 cos 2 θ 1 2 ) + 2 β ( M r ) 2 ] d t 2 (40.25) + [ 1 + 2 γ M r ] [ d r 2 + r 2 ( d θ 2 + sin 2 θ d ϕ 2 ) ] d s 2 = 1 2 M r 2 J 2 M R 2 r 3 3 cos 2 θ 1 2 + 2 β M r 2 d t 2 (40.25) + 1 + 2 γ M r d r 2 + r 2 d θ 2 + sin 2 θ d ϕ 2 {:[ds^(2)=-[1-2(M_(o.))/(r)-2J_(2)((M_(o.)R_(o.)^(2))/(r^(3)))((3cos^(2)theta-1)/(2))+2beta((M_(o.))/(r))^(2)]dt^(2)],[(40.25)+[1+2gamma(M_(o.))/(r)][dr^(2)+r^(2)(dtheta^(2)+sin^(2)theta dphi^(2))]]:}\begin{align*} d s^{2}= & -\left[1-2 \frac{M_{\odot}}{r}-2 J_{2}\left(\frac{M_{\odot} R_{\odot}{ }^{2}}{r^{3}}\right)\left(\frac{3 \cos ^{2} \theta-1}{2}\right)+2 \beta\left(\frac{M_{\odot}}{r}\right)^{2}\right] d t^{2} \\ & +\left[1+2 \gamma \frac{M_{\odot}}{r}\right]\left[d r^{2}+r^{2}\left(d \theta^{2}+\sin ^{2} \theta d \phi^{2}\right)\right] \tag{40.25} \end{align*}ds2=[12Mr2J2(MR2r3)(3cos2θ12)+2β(Mr)2]dt2(40.25)+[1+2γMr][dr2+r2(dθ2+sin2θdϕ2)]
  • corrections of post-post-Newtonian magnitude.
    (b) Let a test particle move in a bound orbit in the equatorial plane. Use Hamilton-Jacobi theory to show that its orbit is a precessing ellipse [equation (40.17)] with a precession per orbit given by
(40.26) δ ϕ ˙ 0 = 2 β + 2 γ 3 6 π M a ( 1 e 2 ) + J 2 3 π R 2 a 2 ( 1 e 2 ) 2 (40.26) δ ϕ ˙ 0 = 2 β + 2 γ 3 6 π M a 1 e 2 + J 2 3 π R 2 a 2 1 e 2 2 {:(40.26)deltaphi^(˙)_(0)=(2-beta+2gamma)/(3)(6piM_(o.))/(a(1-e^(2)))+J_(2)(3piR_(o.)^(2))/(a^(2)(1-e^(2))^(2)):}\begin{equation*} \delta \dot{\phi}_{0}=\frac{2-\beta+2 \gamma}{3} \frac{6 \pi M_{\odot}}{a\left(1-e^{2}\right)}+J_{2} \frac{3 \pi R_{\odot}^{2}}{a^{2}\left(1-e^{2}\right)^{2}} \tag{40.26} \end{equation*}(40.26)δϕ˙0=2β+2γ36πMa(1e2)+J23πR2a2(1e2)2
For the significance of this result, see Box 40.3.
The rest of this chapter is Track 2 . No earlier Track-2 material is needed as preparation for it, but the following will be helpful:
(1) Chapter 6 (accelerated observers);
(2) §17.6 (no prior geometry); and
(3) Chapters 38 and 39 (tests of foundations; other theories; PPN formalism). It is not needed as preparation for any later chapter.
3-body effects in lunar orbit:
(1) theory
(2) prospects for measurement

§40.6. THREE-BODY EFFECTS IN THE LUNAR ORBIT

The relativistic effects discussed thus far all involve the spherical part of the sun's external gravitational field, and thus they can probe only the PPN parameters β β beta\betaβ and γ γ gamma\gammaγ plus the "preferred-frame" parameters α 1 , α 2 α 1 , α 2 alpha_(1),alpha_(2)\alpha_{1}, \alpha_{2}α1,α2, and α 3 α 3 alpha_(3)\alpha_{3}α3. Attempts to measure other PPN parameters can focus on three-body interactions (discussed here), the dragging of inertial frames by a rotating body ( § 40.7 § 40.7 §40.7\S 40.7§40.7 ), anomalies in the locally measured gravitational constant ( $ 40.8 $ 40.8 $40.8\$ 40.8$40.8 ), and deviations of planetary and lunar orbits from geodesics ( $ 40.9 $ 40.9 $40.9\$ 40.9$40.9 ).
There is no better place to study three-body interactions than the Earth-moon orbit. The pulls of the Earth, the moon, and the sun all contribute. Perturbations in the motion of Earth and moon about their common center of gravity can be measured with high precision using laser ranging (earth-moon separation measured to 10 cm 10 cm ∼10cm\sim 10 \mathrm{~cm}10 cm in early 1970 's) and using a radio beacon on the moon's surface (angular position on sky potentially measurable to better than 0 .001 0 .001 0^('').0010^{\prime \prime} .0010.001 of arc).
Over and above any Newtonian three-body interactions, the Earth and the sun, acting together in a nonlinear manner, should produce relativistic perturbations in the lunar orbit that are barely within the range of this technology. These effects depend on the familiar parameters γ γ gamma\gammaγ (measuring space curvature) and β β beta\betaβ [measuring amount of nonlinear superposition, ( U Earth + U sun ) 2 U Earth  + U sun  2 (U_("Earth ")+U_("sun "))^(2)\left(U_{\text {Earth }}+U_{\text {sun }}\right)^{2}(UEarth +Usun )2, in g 00 ] g 00 {:g_(00)]\left.g_{00}\right]g00]. In addition, they depend on β 2 β 2 beta_(2)\beta_{2}β2, which regulates the extent to which the sun's potential, U sun U sun  U_("sun ")U_{\text {sun }}Usun , acting inside the Earth, affects the strength of the Earth's gravitational pull, causing it to vary as the Earth moves nearer and farther from the sun. These effects are expected to depend also on ζ , Δ 1 ζ , Δ 1 zeta,Delta_(1)\zeta, \Delta_{1}ζ,Δ1, and Δ 2 Δ 2 Delta_(2)\Delta_{2}Δ2, which regulate the extent to which the Earth's orbital momentum and anisotropies in kinetic energy (caused by the sun) gravitate.
Bromberg (1958), Baierlein (1967), and Krogh and Baierlein (1968) have calculated the three dominant three-body effects in the Earth-moon orbit using general relativity and the Dicke-Brans-Jordan theory. These effects are noncumulative and have amplitudes of 100 cm , 20 cm 100 cm , 20 cm ∼100cm,∼20cm\sim 100 \mathrm{~cm}, \sim 20 \mathrm{~cm}100 cm,20 cm, and 10 cm 10 cm ∼10cm\sim 10 \mathrm{~cm}10 cm. The 100 cm 100 cm 100-cm100-\mathrm{cm}100cm effect [which was originally discovered by de Sitter (1916)] is known to depend only on γ γ gamma\gammaγ. The precise dependence of the other effects on the PPN parameters is not known.
The prospects for measuring these effects in the 1970's are dim; they are masked by peculiarities in the orbit of the moon that have nothing to do with relativity.

§40.7. THE DRAGGING OF INERTIAL FRAMES

The experiments discussed thus far study the motion of electromagnetic waves, spacecraft, planets, and asteroids through the solar system. An entirely different type of experiment measures changes in the orientation of a gyroscope moving in the gravitational field of the Earth. This experiment is particularly important because it can measure directly the "dragging of inertial frames" by the angular momentum of the Earth.
It is useful, before specializing to a rotating Earth, to derive a general expression for the precession of a gyroscope in the post-Newtonian limit. (Track-1 readers, and others who have not studied Chapters 6 and 39, may have difficulty following the derivation. No matter. It is the answer that counts!)
Let S α S α S^(alpha)S^{\alpha}Sα be the spin of the gyroscope (i.e., its angular momentum vector), and let u α u α u^(alpha)u^{\alpha}uα be its 4 -velocity. The spin is always orthogonal to the 4 -velocity, S α u α = 0 S α u α = 0 S^(alpha)u_(alpha)=0S^{\alpha} u_{\alpha}=0Sαuα=0 (see Box 5.6). Assume that any nongravitational forces acting on the gyroscope are applied at its center of mass, so that there is no torque in its proper reference frame. Then the gyroscope will "Fermi-Walker transport" its spin along its world line (see §6.5):
(40.27) u S = u ( a S ) , a u u = 4-acceleration. (40.27) u S = u ( a S ) , a u u =  4-acceleration.  {:(40.27)grad_(u)S=u(a*S)","quad a-=grad_(u)u=" 4-acceleration. ":}\begin{equation*} \boldsymbol{\nabla}_{u} \boldsymbol{S}=\boldsymbol{u}(\boldsymbol{a} \cdot \boldsymbol{S}), \quad \boldsymbol{a} \equiv \boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{u}=\text { 4-acceleration. } \tag{40.27} \end{equation*}(40.27)uS=u(aS),auu= 4-acceleration. 
The objective of the calculation is to write down and analyze this transport equation in the post-Newtonian limit.
The gyroscope moves relative to the PPN coordinate grid with a velocity v j v j v_(j)-=v_{j} \equivvj d x j / d t d x j / d t d x j / d t d x j / d t dx^(j)//dt-=dx_(j)//dtd x^{j} / d t \equiv d x_{j} / d tdxj/dtdxj/dt. Assume that v j ϵ v j ϵ v_(j) <= epsilonv_{j} \leq \epsilonvjϵ, where ϵ ϵ epsilon\epsilonϵ is the post-Newtonian expansion parameter ( ϵ 2 M / R ϵ 2 M / R epsilon^(2)~~M_(o.)//R_(o.)\epsilon^{2} \approx M_{\odot} / R_{\odot}ϵ2M/R ). As the gyroscope moves, it carries with itself an orthonormal frame e α ^ e α ^ e_( hat(alpha))\boldsymbol{e}_{\hat{\alpha}}eα^, which is related to the PPN coordinate frame by a pure Lorentz boost, plus a renormalization of the lengths of the basis vectors [transformation (39.41)]. The spin is a purely spatial vector ( S 0 = 0 ) S 0 = 0 (S^(0)=0)\left(S^{0}=0\right)(S0=0) in this comoving frame; its length ( S j S j ) 1 / 2 S j S j 1 / 2 (S_(j)S_(j))^(1//2)\left(S_{j} S_{j}\right)^{1 / 2}(SjSj)1/2 remains fixed (conservation of angular momentum); and its direction is regulated by the Fermi-Walker transport law.
The basis vectors e α ^ e α ^ e_( hat(alpha))\boldsymbol{e}_{\hat{\alpha}}eα^ of the comoving frame are not Fermi-Walker transported, by contrast with the spin. Rather, they are tied by a pure boost (no rotation!) to the PPN coordinate grid, which in turn is tied to an inertial frame far from the solar system, which in turn one expects to be fixed relative to the "distant stars." Thus, by calculating the precession of the spin relative to the comoving frame,
(40.28) d S j / d τ ε j k ^ ı Ω k ^ S ı , (40.28) d S j / d τ ε j k ^ ı Ω k ^ S ı , {:(40.28)dS_(j)//d tau-=epsi_(j hat(k)ı)Omega_( hat(k))S_(ı)",":}\begin{equation*} d S_{j} / d \tau \equiv \varepsilon_{j \hat{k} \imath} \Omega_{\hat{k}} S_{\imath}, \tag{40.28} \end{equation*}(40.28)dSj/dτεjk^ıΩk^Sı,
one is in effect evaluating the spin's angular velocity of precession, Ω j Ω j Omega_(j)\Omega_{\mathfrak{j}}Ωj, relative to a frame fixed on the sky by the distant stars.
Calculate d S j / d τ d S j / d τ dS_(j)//d taud S_{j} / d \taudSj/dτ :
(40.29) d S j ^ d τ = u ( S e j ) = ( u S ) e j + s ( u e j ) = s u e j ^ . (40.29) d S j ^ d τ = u S e j = u S e j + s u e j = s u e j ^ . {:(40.29)(dS_( hat(j)))/(d tau)=grad_(u)(S*e_(j))=(grad_(u)S)*e_(j)+s*(grad_(u)e_(j))=s*grad_(u)e_( hat(j)).:}\begin{equation*} \frac{d S_{\hat{j}}}{d \tau}=\boldsymbol{\nabla}_{u}\left(\boldsymbol{S} \cdot \boldsymbol{e}_{j}\right)=\left(\boldsymbol{\nabla}_{u} \boldsymbol{S}\right) \cdot \boldsymbol{e}_{j}+\boldsymbol{s} \cdot\left(\boldsymbol{\nabla}_{u} \boldsymbol{e}_{j}\right)=\boldsymbol{s} \cdot \boldsymbol{\nabla}_{u} \boldsymbol{e}_{\hat{j}} . \tag{40.29} \end{equation*}(40.29)dSj^dτ=u(Sej)=(uS)ej+s(uej)=suej^.
Here use is made of the fact that u S u S grad_(u)S\boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{S}uS is in the u u u\boldsymbol{u}u direction [equation (40.27)] and
Gyroscope precession:
(1) general analysis
is thus orthogonal to e j ^ e j ^ e_( hat(j))\boldsymbol{e}_{\hat{j}}ej^. The quantity S u e j ^ S u e j ^ S*grad_(u)e_( hat(j))\boldsymbol{S} \cdot \boldsymbol{\nabla}_{\boldsymbol{u}} \boldsymbol{e}_{\hat{j}}Suej^ is readily evaluated in the PPN coordinate frame. In the evaluation, one uses as metric coefficients [equations (39.32)] the expressions
g 00 = 1 + 2 U + O ( ϵ 4 ) , g j k = δ j k ( 1 + 2 γ U ) + O ( ϵ 4 ) , (40.30) g 0 j = 7 2 Δ 1 V j 1 2 Δ 2 W j + ( "preferred- frame terms" ) + O ( ϵ 5 ) ; g 00 = 1 + 2 U + O ϵ 4 , g j k = δ j k ( 1 + 2 γ U ) + O ϵ 4 , (40.30) g 0 j = 7 2 Δ 1 V j 1 2 Δ 2 W j + (  "preferred-   frame terms"  ) + O ϵ 5 ; {:[g_(00)=-1+2U+O(epsilon^(4))","quadg_(jk)=delta_(jk)(1+2gamma U)+O(epsilon^(4))","],[(40.30)g_(0j)=-(7)/(2)Delta_(1)V_(j)-(1)/(2)Delta_(2)W_(j)+((" "preferred- ")/(" frame terms" "))+O(epsilon^(5));]:}\begin{align*} & g_{00}=-1+2 U+O\left(\epsilon^{4}\right), \quad g_{j k}=\delta_{j k}(1+2 \gamma U)+O\left(\epsilon^{4}\right), \\ & g_{0 j}=-\frac{7}{2} \Delta_{1} V_{j}-\frac{1}{2} \Delta_{2} W_{j}+\binom{\text { "preferred- }}{\text { frame terms" }}+O\left(\epsilon^{5}\right) ; \tag{40.30} \end{align*}g00=1+2U+O(ϵ4),gjk=δjk(1+2γU)+O(ϵ4),(40.30)g0j=72Δ1Vj12Δ2Wj+( "preferred-  frame terms" )+O(ϵ5);
one takes as the components of e j ^ e j ^ e_( hat(j))\boldsymbol{e}_{\hat{j}}ej^ and S S S\boldsymbol{S}S [obtained via the transformation (39.41)] the expressions
e j 0 = v j + O ( ϵ 3 ) , e j k = ( 1 γ U ) δ j k + 1 2 v k v j + O ( ϵ 4 ) , (40.31) S 0 = v j S j ^ + O ( ϵ 3 S j ) , S k = ( 1 γ U ) S k ^ + 1 2 v k v j S j ^ + O ( ϵ 4 S j ) ; e j 0 = v j + O ϵ 3 , e j k = ( 1 γ U ) δ j k + 1 2 v k v j + O ϵ 4 , (40.31) S 0 = v j S j ^ + O ϵ 3 S j , S k = ( 1 γ U ) S k ^ + 1 2 v k v j S j ^ + O ϵ 4 S j ; {:[e_(j)^(0)=v_(j)+O(epsilon^(3))","quade_(j)^(k)=(1-gamma U)delta_(jk)+(1)/(2)v_(k)v_(j)+O(epsilon^(4))","],[(40.31)S^(0)=v_(j)S_( hat(j))+O(epsilon^(3)S_(j))","],[S^(k)=(1-gamma U)S_( hat(k))+(1)/(2)v_(k)v_(j)S_( hat(j))+O(epsilon^(4)S_(j));]:}\begin{gather*} e_{j}^{0}=v_{j}+O\left(\epsilon^{3}\right), \quad e_{j}^{k}=(1-\gamma U) \delta_{j k}+\frac{1}{2} v_{k} v_{j}+O\left(\epsilon^{4}\right), \\ S^{0}=v_{j} S_{\hat{j}}+O\left(\epsilon^{3} S_{j}\right), \tag{40.31}\\ S^{k}=(1-\gamma U) S_{\hat{k}}+\frac{1}{2} v_{k} v_{j} S_{\hat{j}}+O\left(\epsilon^{4} S_{j}\right) ; \end{gather*}ej0=vj+O(ϵ3),ejk=(1γU)δjk+12vkvj+O(ϵ4),(40.31)S0=vjSj^+O(ϵ3Sj),Sk=(1γU)Sk^+12vkvjSj^+O(ϵ4Sj);
and one uses the relation
(40.32) d v j / d τ = a j + U , j + O ( ϵ 2 U , j ) (40.32) d v j / d τ = a j + U , j + O ϵ 2 U , j {:(40.32)dv_(j)//d tau=a_(j)+U_(,j)+O(epsilon^(2)U_(,j)):}\begin{equation*} d v_{j} / d \tau=a_{j}+U_{, j}+O\left(\epsilon^{2} U_{, j}\right) \tag{40.32} \end{equation*}(40.32)dvj/dτ=aj+U,j+O(ϵ2U,j)
where a j a j a_(j)a_{j}aj (assumed U , j U , j <= U_(,j)\leqq U_{, j}U,j ) are the components of the 4 -acceleration. One finds (see exercise 40.6 ) for the precession of the spin the result
d S j / d τ = S u e j = S k [ v [ j a k ] + g 0 [ k , j ] ( 2 γ + 1 ) v [ j U , k ] ] . d S j / d τ = S u e j = S k v [ j a k ] + g 0 [ k , j ] ( 2 γ + 1 ) v [ j U , k ] . dS_(j)//d tau=S*grad_(u)e_(j)=S_(k)[v_([j)a_(k])+g_(0[k,j])-(2gamma+1)v_([j)U_(,k])].d S_{j} / d \tau=\boldsymbol{S} \cdot \boldsymbol{\nabla}_{u} \boldsymbol{e}_{j}=S_{k}\left[v_{[j} a_{k]}+g_{0[k, j]}-(2 \gamma+1) v_{[j} U_{, k]}\right] .dSj/dτ=Suej=Sk[v[jak]+g0[k,j](2γ+1)v[jU,k]].
Rewritten in three-dimensional vector form this result becomes
(2) general PPN formula for precession
(3) specialization: Thomas precession
(40.33a) d S / d τ = Ω × S (40.33b) Ω 1 2 v × a 1 2 × g + ( γ + 1 2 ) v × U (40.33c) g g 0 j e j ^ . (40.33a) d S / d τ = Ω × S (40.33b) Ω 1 2 v × a 1 2 × g + γ + 1 2 v × U (40.33c) g g 0 j e j ^ . {:[(40.33a)dS//d tau=Omega xx S],[(40.33b)Omega-=-(1)/(2)v xx a-(1)/(2)grad xx g+(gamma+(1)/(2))v xx grad U],[(40.33c)g-=g_(0j)e_( hat(j)).]:}\begin{gather*} d \boldsymbol{S} / d \tau=\boldsymbol{\Omega} \times \boldsymbol{S} \tag{40.33a}\\ \boldsymbol{\Omega} \equiv-\frac{1}{2} v \times a-\frac{1}{2} \nabla \times \boldsymbol{g}+\left(\gamma+\frac{1}{2}\right) v \times \nabla U \tag{40.33b}\\ \boldsymbol{g} \equiv g_{0 j} e_{\hat{j}} . \tag{40.33c} \end{gather*}(40.33a)dS/dτ=Ω×S(40.33b)Ω12v×a12×g+(γ+12)v×U(40.33c)gg0jej^.
In this final answer it does not matter whether the 3 -vectors entering into Ω Ω Omega\boldsymbol{\Omega}Ω are evaluated in the coordinate frame or in the comoving orthonormal frame, since e 3 e 3 e_(3)\boldsymbol{e}_{3}e3 and / x j / x j del//delx_(j)\partial / \partial x_{j}/xj differ only by corrections of order ϵ 2 ϵ 2 epsilon^(2)\epsilon^{2}ϵ2.
Equations (40.33) describe in complete generality at the post-Newtonian level of approximation the precession of the gyroscope spin S S S\boldsymbol{S}S relative to the comoving orthonormal frame that is rotationally tied to the distant stars.
For an electron with spin S S S\boldsymbol{S}S in orbit around a proton, only the first term, 1 2 ν × a 1 2 ν × a -(1)/(2)nu xx a-\frac{1}{2} \boldsymbol{\nu} \times \boldsymbol{a}12ν×a, is present (no gravity). This term leads to the Thomas precession, which plays an important role in the fine structure of atomic spectra [see, e.g., Ruark and Urey (1930)]. For other ways of deriving the Thomas precession, see exercise 6.9 and $ 41.4 $ 41.4 $41.4\$ 41.4$41.4.
The Thomas precession comes into play for a gyroscope on the surface of the Earth ( a = a = a=a=a= Newtonian acceleration of gravity), but not for a gyroscope in a freely moving satellite.
If one ignores the rotation of the Earth and preferred-frame effects, and puts the PPN coordinate frame at rest relative to the center of the Earth, then g 0 j g 0 j g_(0j)g_{0 j}g0j vanishes and Ω Ω Omega\boldsymbol{\Omega}Ω is given by
Ω = ν × [ 1 2 a + ( γ + 1 2 ) U ] (40.34) = γ ν × U for gyroscope on Earth's surface = ( γ + 1 2 ) ν × U for gyroscope in orbit. Ω = ν × 1 2 a + γ + 1 2 U (40.34) = γ ν × U  for gyroscope on Earth's surface  = γ + 1 2 ν × U  for gyroscope in orbit.  {:[Omega=nu xx[-(1)/(2)a+(gamma+(1)/(2))grad U]],[(40.34)=gamma nu xx grad U" for gyroscope on Earth's surface "],[=(gamma+(1)/(2))nu xx grad U" for gyroscope in orbit. "]:}\begin{align*} \boldsymbol{\Omega} & =\boldsymbol{\nu} \times\left[-\frac{1}{2} \boldsymbol{a}+\left(\gamma+\frac{1}{2}\right) \nabla U\right] \\ & =\gamma \boldsymbol{\nu} \times \nabla U \text { for gyroscope on Earth's surface } \tag{40.34}\\ & =\left(\gamma+\frac{1}{2}\right) \boldsymbol{\nu} \times \boldsymbol{\nabla} U \text { for gyroscope in orbit. } \end{align*}Ω=ν×[12a+(γ+12)U](40.34)=γν×U for gyroscope on Earth's surface =(γ+12)ν×U for gyroscope in orbit. 
The general-relativistic term ( γ + 1 2 ) v × U γ + 1 2 v × U (gamma+(1)/(2))v xx grad U\left(\gamma+\frac{1}{2}\right) \boldsymbol{v} \times \boldsymbol{\nabla} U(γ+12)v×U is caused by the motion of the gyroscope through the Earth's curved, static spacetime geometry. Notice that it depends solely on the same parameter γ γ gamma\gammaγ as is tested by electromagnetic-wave experiments. In order of magnitude, for a gyroscope in a near-Earth, polar orbit,
(40.35) Ω 3 2 ( M E R E ) 1 / 2 ( M E R E 2 ) 8 seconds of arc per year. (40.35) Ω 3 2 M E R E 1 / 2 M E R E 2 8  seconds of arc per year.  {:(40.35)Omega~~(3)/(2)((M_(E))/(R_(E)))^(1//2)((M_(E))/(R_(E)^(2)))~~8" seconds of arc per year. ":}\begin{equation*} \Omega \approx \frac{3}{2}\left(\frac{M_{E}}{R_{E}}\right)^{1 / 2}\left(\frac{M_{E}}{R_{E}^{2}}\right) \approx 8 \text { seconds of arc per year. } \tag{40.35} \end{equation*}(40.35)Ω32(MERE)1/2(MERE2)8 seconds of arc per year. 
The general-relativistic precession 3 2 v × U 3 2 v × U (3)/(2)v xx grad U\frac{3}{2} v \times \nabla U32v×U was derived by W. de Sitter (1916) for the "Earth-moon gyroscope" orbiting the sun. Eleven years later L. H. Thomas (1927) derived the special relativistic precession 1 2 v × a 1 2 v × a -(1)/(2)v xx a-\frac{1}{2} \boldsymbol{v} \times \boldsymbol{a}12v×a for application to atomic physics.
The Earth's rotation produces off-diagonal terms, g 0 j g 0 j g_(0j)g_{0 j}g0j, in the PPN metric (exercise 40.7):
(40.36) g = g 0 j e j = ( 7 4 Δ 1 + 1 4 Δ 2 ) J × r r 3 (40.36) g = g 0 j e j = 7 4 Δ 1 + 1 4 Δ 2 J × r r 3 {:(40.36)g=g_(0j)e_(j)=-((7)/(4)Delta_(1)+(1)/(4)Delta_(2))(J xx r)/(r^(3)):}\begin{equation*} g=g_{0 j} e_{j}=-\left(\frac{7}{4} \Delta_{1}+\frac{1}{4} \Delta_{2}\right) \frac{\boldsymbol{J} \times \boldsymbol{r}}{r^{3}} \tag{40.36} \end{equation*}(40.36)g=g0jej=(74Δ1+14Δ2)J×rr3
Here J J J\boldsymbol{J}J is the Earth's angular momentum. These off-diagonal terms contribute an amount
(40.37) Ω = 1 2 × g = ( 7 8 Δ 1 + 1 8 Δ 2 ) 1 r 3 [ J + 3 ( J r ) r r 2 ] (40.37) Ω = 1 2 × g = 7 8 Δ 1 + 1 8 Δ 2 1 r 3 J + 3 ( J r ) r r 2 {:(40.37)Omega=-(1)/(2)grad xx g=((7)/(8)Delta_(1)+(1)/(8)Delta_(2))(1)/(r^(3))[-J+(3(J*r)r)/(r^(2))]:}\begin{equation*} \Omega=-\frac{1}{2} \nabla \times g=\left(\frac{7}{8} \Delta_{1}+\frac{1}{8} \Delta_{2}\right) \frac{1}{r^{3}}\left[-J+\frac{3(J \cdot r) r}{r^{2}}\right] \tag{40.37} \end{equation*}(40.37)Ω=12×g=(78Δ1+18Δ2)1r3[J+3(Jr)rr2]
to the precession of the gyroscope. Notice that this contribution, unlike the others, is independent of the linear velocity of the gyroscope. One can think of it in the following way.
The gyroscope is rotationally at rest relative to the inertial frames in its neighborhood. It and the local inertial frames rotate relative to the distant galaxies with the angular velocity Ω Ω Omega\boldsymbol{\Omega}Ω because the Earth's rotation "drags" the local inertial frames along with it. Notice that near the north and south poles the local inertial frames rotate in the same direction as the Earth does ( Ω Ω Omega\boldsymbol{\Omega}Ω parallel to J J J\boldsymbol{J}J ), but near the equator they rotate in the opposite direction ( Ω Ω Omega\boldsymbol{\Omega}Ω antiparallel to J J J\boldsymbol{J}J; compare Ω Ω Omega\boldsymbol{\Omega}Ω with the magnetic field of the Earth!). Although this might seem paradoxical at first, an analogy devised by Schiff makes it seem more reasonable.* Consider a rotating, solid sphere immersed in a viscous fluid. As it rotates, the sphere will drag the fluid along with it. At various points in the fluid, set down little rods, and watch how the fluid
(4) specialization: precessions due to acceleration and Earth's Newtonian potential
(5) specialization: precession due to Earth's rotation
rotates them as it flows past. Near the poles the fluid will clearly rotate the rods in the same direction as the star rotates. But near the equator, because the fluid is dragged more rapidly at small radii than at large, the end of a rod closest to the sphere is dragged by the fluid more rapidly than the far end of the rod. Consequently, the rod rotates in the direction opposite to the rotation of the sphere.
In order of magnitude, the precessional angular velocity caused by the Earth's rotation is
( ) Ω J E / R E 3 0.1 seconds of arc per year. ( ) Ω J E / R E 3 0.1  seconds of arc per year.  {:('")"Omega∼J_(E)//R_(E)^(3)∼0.1" seconds of arc per year. ":}\begin{equation*} \Omega \sim J_{E} / R_{E}^{3} \sim 0.1 \text { seconds of arc per year. } \tag{$\prime$} \end{equation*}()ΩJE/RE30.1 seconds of arc per year. 
(6) prospects for measuring precession
Both this precession, and the larger one [equation (40.35)] due to motion through the Earth's static field, may be detectable in the 1970's. Equipment aimed at detecting them via a satellite experiment is now (1973) under construction at Stanford University; see Everitt, Fairbank, and Hamilton (1970); also O'Connell (1972).*
The gyroscope precession produced by motion of the Earth relative to the preferred frame (if any) is too small to be of much interest.

EXERCISES

Exercise 40.6. PRECESSIONAL ANGULAR VELOCITY

Derive equations (40.33) for the precession of a gyroscope in the post-Newtonian limit. Base the derivation on equations (40.29)-(40.32).

Exercise 40.7. OFF-DIAGONAL TERMS IN METRIC ABOUT THE EARTH

Idealize the Earth as an isolated, rigidly rotating sphere with angular momentum J J JJJ. Use equations ( 39.34 b , c 39.34 b , c 39.34b,c39.34 \mathrm{~b}, \mathrm{c}39.34 b,c ) and (39.27) to show that (in three-dimensional vector notation)
(40.38) V V j e j = W W j e j = 1 2 J × r / r 3 (40.38) V V j e j = W W j e j = 1 2 J × r / r 3 {:(40.38)V-=V_(j)e_(j)=W-=W_(j)e_(j)=(1)/(2)J xx r//r^(3):}\begin{equation*} \boldsymbol{V} \equiv V_{j} \boldsymbol{e}_{j}=\boldsymbol{W} \equiv W_{j} \boldsymbol{e}_{j}=\frac{1}{2} \boldsymbol{J} \times \boldsymbol{r} / r^{3} \tag{40.38} \end{equation*}(40.38)VVjej=WWjej=12J×r/r3
outside the Earth, in the Earth's PPN rest frame. From this, infer equation (40.36).

Exercise 40.8. SPIN-CURVATURE COUPLING

Consider a spinning body (e.g., the Earth or a gyroscope or an electron) moving through curved spacetime. Tidal gravitational forces produced by the curvature of spacetime act on the elementary pieces of the spinning body. These forces should depend not only on the positions of the pieces relative to the center of the object, but also on their relative velocities. Moreover, the spin of the body,
S ( ρ r × v ) d ( volume ) in comoving orthonormal frame S ( ρ r × v ) d (  volume  )  in comoving orthonormal frame  S-=int(rho r xx v)d(" volume ")quad" in comoving orthonormal frame "\boldsymbol{S} \equiv \int(\rho \boldsymbol{r} \times \boldsymbol{v}) d(\text { volume }) \quad \text { in comoving orthonormal frame }S(ρr×v)d( volume ) in comoving orthonormal frame 
is a measure of the relative positions and velocities of its pieces. Therefore one expects the spin to couple to the tidal gravitational forces-i.e., to the curvature of spacetime-producing
deviations from geodesic motion. Careful solution of the PPN equations of Chapter 39 for general relativity reveals [Papapetrou (1951), Pirani (1956)] that such coupling occurs and causes a deviation of the worldline from the course that it would otherwise take; thus,
(40.39) m D u α d τ = S μ u ν D 2 u β d τ 2 ϵ α μ ν β + 1 2 ( ϵ λ μ ρ τ R α ν λ μ ) u ν S ρ u τ (40.39) m D u α d τ = S μ u ν D 2 u β d τ 2 ϵ α μ ν β + 1 2 ϵ λ μ ρ τ R α ν λ μ u ν S ρ u τ {:(40.39)m(Du^(alpha))/(d tau)=-S_(mu)u_(nu)(D^(2)u_(beta))/(dtau^(2))epsilon^(alpha mu nu beta)+(1)/(2)(epsilon^(lambda mu rho tau)R^(alpha nu)_(lambda mu))u_(nu)S_(rho)u_(tau):}\begin{equation*} m \frac{D u^{\alpha}}{d \tau}=-S_{\mu} u_{\nu} \frac{D^{2} u_{\beta}}{d \tau^{2}} \epsilon^{\alpha \mu \nu \beta}+\frac{1}{2}\left(\epsilon^{\lambda \mu \rho \tau} R^{\alpha \nu}{ }_{\lambda \mu}\right) u_{\nu} S_{\rho} u_{\tau} \tag{40.39} \end{equation*}(40.39)mDuαdτ=SμuνD2uβdτ2ϵαμνβ+12(ϵλμρτRανλμ)uνSρuτ
Evaluate, in order of magnitude, the effects of the supplementary term on planetary orbits in the solar system.
[Answer: They are much too small to be detected. However, there are two other material places to look for the effect: (1) when a rapidly spinning neutron star, or a black hole endowed with substantial angular momentum enters the powerful tidal field of another neutron stur or black hole; and (2) when an individual electron, or the totality of electrons in the "Dirac sea of negative energy states," enter a still more powerful tidal field (late phase of gravitational collapse). Such a tidal field, or curvature, pulls oppositely on electrons with the two opposite directions of spin [Pirani (1956); DeWitt (1962), p. 338; Schwinger (1963a,b)] just as an electric field pulls oppositely on electrons with the two opposite signs of charge ["vacuum polarization"; see especially Heisenberg and Euler (1936)]. In principle, the tidal field pulling on the spin of an electron need not be due to "background" spacetime curvature; it might be due to a nearby massive spinning object, such as a "live" black hole (chapter 33) ["gravitational spin-spin coupling"; O’Connell (1972)].

§40.8. IS THE GRAVITATIONAL CONSTANT CONSTANT?

The title and subject of this section are likely to arouse confusion. Throughout this book one has used geometrized units, in which G = c = 1 G = c = 1 G=c=1G=c=1G=c=1. Therefore, one has locked oneself into a viewpoint that forbids asking whether the gravitational constant changes from event to event.
False! One can perfectly well ask the question in the context of G = c = 1 G = c = 1 G=c=1G=c=1G=c=1, if one makes clear what is meant by the question.
In § § 1.5 § § 1.5 §§1.5\S \S 1.5§§1.5 and 1.6 , c 1.6 , c 1.6,c1.6, c1.6,c was defined to be a certain conversion factor between centimeters and seconds; and G / c 2 G / c 2 G//c^(2)G / c^{2}G/c2 was defined to be a certain conversion factor between grams and centimeters. These definitions by fiat do not guarantee, however, that a Cavendish experiment* to measure the attraction between two bodies will yield
Force = G m 1 m 2 / r 2 = m 1 m 2 / r 2  Force  = G m 1 m 2 / r 2 = m 1 m 2 / r 2 " Force "=-Gm_(1)m_(2)//r^(2)=-m_(1)m_(2)//r^(2)\text { Force }=-G m_{1} m_{2} / r^{2}=-m_{1} m_{2} / r^{2} Force =Gm1m2/r2=m1m2/r2
If general relativity correctly describes classical gravity, and if the values of the conversion factors G G GGG and c c ccc have been chosen precisely right, then any Cavendish experiment, anywhere in the universe, will yield "Force = m 1 m 2 / r 2 = m 1 m 2 / r 2 =-m_(1)m_(2)//r^(2)=-m_{1} m_{2} / r^{2}=m1m2/r2 ". But if the
"Cavendish gravitational constant," G C G C G_(C)G_{\mathrm{C}}GC, defined
Changes of G C G C G_(C)G_{C}GC with time
Spatial variations in G C G C G_(C)G_{C}GC
Dicke-Brans-Jordan theory, or almost any other metric theory gives the correct description of gravity, the force in the Cavendish experiment will depend on where and when the experiment is performed, as well as on m 1 , m 2 m 1 , m 2 m_(1),m_(2)m_{1}, m_{2}m1,m2, and r r rrr. To discuss Cavendish experiments as tests of gravitation theory, then, one must introduce a new proportionality factor
(40.40) G C G Cavendish ("Cavendish gravitational constant"), (40.40) G C G Cavendish   ("Cavendish gravitational constant"),  {:(40.40)G_(C)-=G_("Cavendish ")-=" ("Cavendish gravitational constant"), ":}\begin{equation*} G_{\mathrm{C}} \equiv G_{\text {Cavendish }} \equiv \text { ("Cavendish gravitational constant"), } \tag{40.40} \end{equation*}(40.40)GCGCavendish  ("Cavendish gravitational constant"), 
which enters into the Newtonian force law
(40.41) Force = G C m 1 m 2 / r 2 (40.41)  Force  = G C m 1 m 2 / r 2 {:(40.41)" Force "=-G_(C)m_(1)m_(2)//r^(2):}\begin{equation*} \text { Force }=-G_{\mathrm{C}} m_{1} m_{2} / r^{2} \tag{40.41} \end{equation*}(40.41) Force =GCm1m2/r2
This Cavendish constant will be unity in general relativity, but in most other metric theories it will vary from event to event in spacetime.
In some theories, such as Dicke-Brans-Jordan, the Cavendish constant is determined by the distribution of matter in the universe. As a result, the expansion of the universe changes its value:
1 G C d G C d t ( 0.1 to 1 age of universe ) 1 10 10 or 10 11 years 1 G C d G C d t 0.1  to  1  age of universe  1 10 10  or  10 11  years  (1)/(G_(C))(dG_(C))/(dt)∼-((0.1" to "1)/(" age of universe "))∼(-1)/(10^(10)" or "10^(11)" years ")\frac{1}{G_{\mathrm{C}}} \frac{d G_{\mathrm{C}}}{d t} \sim-\left(\frac{0.1 \text { to } 1}{\text { age of universe }}\right) \sim \frac{-1}{10^{10} \text { or } 10^{11} \text { years }}1GCdGCdt(0.1 to 1 age of universe )11010 or 1011 years 
[see, e.g. Brans and Dicke (1961)]. A variety of observations place limits on such time variations. Big time changes in G C G C G_(C)G_{\mathrm{C}}GC during the last 4.6 billion years would have produced marked effects on the Earth, the sun, and the entire solar system. The expected geophysical effects have been summarized and compared with observations by Dicke and Peebles (1965). It is hard to draw firm conclusions because of the complexity of the geophysics involved, but a fairly certain limit is
(40.42a) ( 1 / G C ) ( d G C / d t ) 1 / 10 10 years (geophysical). (40.42a) 1 / G C d G C / d t 1 / 10 10  years   (geophysical).  {:(40.42a)(1//G_(C))(dG_(C)//dt) <= 1//10^(10)" years "quad" (geophysical). ":}\begin{equation*} \left(1 / G_{\mathrm{C}}\right)\left(d G_{\mathrm{C}} / d t\right) \leqq 1 / 10^{10} \text { years } \quad \text { (geophysical). } \tag{40.42a} \end{equation*}(40.42a)(1/GC)(dGC/dt)1/1010 years  (geophysical). 
Eventually, high-precision measurements of the orbital motions of planets will yield a better limit. For the present, planetary observations show
(40.42b) ( 1 / G C ) ( d G O / d t ) 4 / 10 10 years (planetary orbits) (40.42b) 1 / G C d G O / d t 4 / 10 10  years   (planetary orbits)  {:(40.42b)(1//G_(C))(dG_(O)//dt) <= 4//10^(10)" years "quad" (planetary orbits) ":}\begin{equation*} \left(1 / G_{\mathrm{C}}\right)\left(d G_{\mathrm{O}} / d t\right) \leqq 4 / 10^{10} \text { years } \quad \text { (planetary orbits) } \tag{40.42b} \end{equation*}(40.42b)(1/GC)(dGO/dt)4/1010 years  (planetary orbits) 
[Shapiro, Smith, et al.(1971)]. These limits are tight enough to begin to be interesting, but not yet tight enough to disprove any otherwise viable theories of gravity.
If G C G C G_(C)G_{\mathrm{C}}GC is determined by the distribution of matter in the universe, then it should depend on where in the universe one is, as well as when. In particular, as one moves from point to point in the solar system, closer to the Sun and then farther away, one should see G C G C G_(C)G_{\mathrm{C}}GC change. Indeed this is the case in most metric theories of gravity, though not in general relativity. Analyses of Cavendish experiments using the PPN formalism reveal spatial variation in G C G C G_(C)G_{\mathrm{C}}GC given by
(40.43) Δ G C = 2 G C ( β + γ β 2 1 ) U (40.43) Δ G C = 2 G C β + γ β 2 1 U {:(40.43)DeltaG_(C)=-2G_(C)(beta+gamma-beta_(2)-1)U:}\begin{equation*} \Delta G_{\mathrm{C}}=-2 G_{\mathrm{C}}\left(\beta+\gamma-\beta_{2}-1\right) U \tag{40.43} \end{equation*}(40.43)ΔGC=2GC(β+γβ21)U
[Nordtvedt (1970, 1971a); Will (1971b)].
The amplitude of these variations along the Earth's elliptical orbit is Δ G C / G C Δ G C / G C DeltaG_(C)//G_(C)∼\Delta G_{\mathrm{C}} / G_{\mathrm{C}} \simΔGC/GC 10 10 10 10 10^(-10)10^{-10}1010, if β + γ β 2 1 1 β + γ β 2 1 1 beta+gamma-beta_(2)-1∼1\beta+\gamma-\beta_{2}-1 \sim 1β+γβ211. This is far too small to measure directly in the
1970's. Despite great ingenuity and effort, the most accurate experiments measuring the value of G C G C G_(C)G_{\mathrm{C}}GC have precisions in 1972 no better than 1 part in 10 4 10 4 10^(4)10^{4}104 [see Beams (1971)]. Experiments to search for yearly variations in G C G C G_(C)G_{\mathrm{C}}GC on Earth without measuring the actual value ("null-type experiments") can surely be performed with better precision than 1 in 10 4 10 4 10^(4)10^{4}104-but not with precisions approaching 1 in 10 10 10 10 10^(10)10^{10}1010. On the other hand, indirect consequences of a spatial variation of G C G C G_(C)G_{\mathrm{C}}GC in the solar system are almost certainly measurable (see § 40.9 § 40.9 §40.9\S 40.9§40.9 below).
In Ni's theory of gravity (Box 39.1), and other two-tensor or vector-tensor theories like it, where the prior geometry picks out a preferred "universal rest frame," the Cavendish constant G C G C G_(C)G_{\mathrm{C}}GC can depend on velocity relative to the preferred frame. For Cavendish experiments with two equal masses separated by distances large compared to their sizes, G C G C G_(C)G_{\mathrm{C}}GC varies as
(40.44) Δ G C = G C [ 1 2 ( α 2 + α 3 α 1 ) v 2 1 2 α 2 ( v n ) 2 ] (40.44) Δ G C = G C 1 2 α 2 + α 3 α 1 v 2 1 2 α 2 ( v n ) 2 {:(40.44)DeltaG_(C)=G_(C)[(1)/(2)(alpha_(2)+alpha_(3)-alpha_(1))v^(2)-(1)/(2)alpha_(2)(v*n)^(2)]:}\begin{equation*} \Delta G_{\mathrm{C}}=G_{\mathrm{C}}\left[\frac{1}{2}\left(\alpha_{2}+\alpha_{3}-\alpha_{1}\right) v^{2}-\frac{1}{2} \alpha_{2}(\boldsymbol{v} \cdot \boldsymbol{n})^{2}\right] \tag{40.44} \end{equation*}(40.44)ΔGC=GC[12(α2+α3α1)v212α2(vn)2]
[Will (1971b)]. Here v v v\boldsymbol{v}v is the velocity of the Cavendish apparatus relative to the preferred frame, and n n n\boldsymbol{n}n is the unit vector between the two masses. For experiments where one body is a massive sphere (e.g., the Earth), and the other is a small object on the sphere's surface, G C G C G_(C)G_{\mathrm{C}}GC varies as
Δ G C / G C = 1 2 [ ( α 3 α 1 ) + α 2 ( 1 I / M R 2 ) ] v 2 ( ) 1 2 α 2 ( 1 3 I / M R 2 ) ( v n ) 2 Δ G C / G C = 1 2 α 3 α 1 + α 2 1 I / M R 2 v 2 ( ) 1 2 α 2 1 3 I / M R 2 ( v n ) 2 {:[DeltaG_(C)//G_(C)=(1)/(2)[(alpha_(3)-alpha_(1))+alpha_(2)(1-I//MR^(2))]v^(2)],[('")"-(1)/(2)alpha_(2)(1-3I//MR^(2))(v*n)^(2)]:}\begin{align*} \Delta G_{\mathrm{C}} / G_{\mathrm{C}}= & \frac{1}{2}\left[\left(\alpha_{3}-\alpha_{1}\right)+\alpha_{2}\left(1-I / M R^{2}\right)\right] \boldsymbol{v}^{2} \\ & -\frac{1}{2} \alpha_{2}\left(1-3 I / M R^{2}\right)(\boldsymbol{v} \cdot \boldsymbol{n})^{2} \tag{$\prime$} \end{align*}ΔGC/GC=12[(α3α1)+α2(1I/MR2)]v2()12α2(13I/MR2)(vn)2
[Nordtvedt and Will (1972)]. Here M M MMM and R R RRR are the mass and radius of the sphere, and
I = ( ρ r 2 ) 4 π r 2 d r I = ρ r 2 4 π r 2 d r I=int(rhor^(2))4pir^(2)drI=\int\left(\rho r^{2}\right) 4 \pi r^{2} d rI=(ρr2)4πr2dr
is the trace of the second moment of its mass distribution. Consequences of these effects for planetary orbits have not yet been spelled out, but consequences for Earthbound experiments have.
Think of a Cavendish experiment in which one mass is the Earth, and the other is a gravimeter on the Earth's surface. The gravimeter gives a reading for the "local acceleration of gravity,"
(40.45) g = G C m Earth / r Earth 2 (40.45) g = G C m Earth  / r Earth  2 {:(40.45)g=G_(C)m_("Earth ")//r_("Earth ")^(2):}\begin{equation*} g=G_{\mathrm{C}} m_{\text {Earth }} / r_{\text {Earth }}{ }^{2} \tag{40.45} \end{equation*}(40.45)g=GCmEarth /rEarth 2
As the Earth turns, so the unit vector n n n\boldsymbol{n}n between its center and the gravimeter rotates, G C G C G_(C)G_{\mathrm{C}}GC and hence g g ggg will fluctuate with a period of 12 sidereal hours and an amplitude
( Δ g / g ) amplitude = 1 4 α 2 v 2 cos 2 θ m ( Δ g / g ) amplitude  = 1 4 α 2 v 2 cos 2 θ m (Delta g//g)_("amplitude ")=(1)/(4)alpha_(2)v^(2)cos^(2)theta_(m)(\Delta g / g)_{\text {amplitude }}=\frac{1}{4} \alpha_{2} v^{2} \cos ^{2} \theta_{m}(Δg/g)amplitude =14α2v2cos2θm
Here θ m θ m theta_(m)\theta_{m}θm is the minimum, as the Earth rotates, of the angle between v v v\boldsymbol{v}v (constant vector) and n n n\boldsymbol{n}n (rotating vector). (Note: we have used the value I / M R 2 0.5 I / M R 2 0.5 I//MR^(2)≃0.5I / M R^{2} \simeq 0.5I/MR20.5 for the Earth.) These fluctuations will produce tides in the Earth of the same type as are
Dependence of G C G C G_(C)G_{C}GC on velocity
Anomalies in Earth tides due to anisotropies in G C G C G_(C)G_{C}GC :

(1) experimental value of α 2 α 2 alpha_(2)\alpha_{2}α2
(2) experimental disproof of Whitehead theory
Anomalies in Earth rotation rate due to dependence of G C G C G_(C)G_{C}GC on velocity
produced by the moon and sun. As of 1972, gravimeter measurements near the Earth's equator show no sign of any anomalous 12 -sidereal-hour effects down to an amplitude of 10 9 10 9 ∼10^(-9)\sim 10^{-9}109 [Will (1971b)]. Consequently,
(40.46a) | α 2 | 1 / 2 v cos θ m = | Δ 2 + ζ 1 | 1 / 2 v cos θ m 6 × 10 5 20 km / sec (40.46a) α 2 1 / 2 v cos θ m = Δ 2 + ζ 1 1 / 2 v cos θ m 6 × 10 5 20 km / sec {:(40.46a)|alpha_(2)|^(1//2)v cos theta_(m)=|Delta_(2)+zeta-1|^(1//2)v cos theta_(m) <= 6xx10^(-5)∼20km//sec:}\begin{equation*} \left|\alpha_{2}\right|^{1 / 2} v \cos \theta_{m}=\left|\Delta_{2}+\zeta-1\right|^{1 / 2} v \cos \theta_{m} \leqq 6 \times 10^{-5} \sim 20 \mathrm{~km} / \mathrm{sec} \tag{40.46a} \end{equation*}(40.46a)|α2|1/2vcosθm=|Δ2+ζ1|1/2vcosθm6×10520 km/sec
Using a rough estimate of v 200 km / sec v 200 km / sec v∼200km//secv \sim 200 \mathrm{~km} / \mathrm{sec}v200 km/sec for the Earth's velocity relative to the universal rest frame, and θ m 60 θ m 60 theta_(m) <= 60^(@)\theta_{m} \leq 60^{\circ}θm60 for the angle between v v vvv and the Earth's equatorial plane, one obtains the rough limit
(40.46b) | α 2 | = | Δ 2 + ζ 1 | 0.03 (40.46b) α 2 = Δ 2 + ζ 1 0.03 {:(40.46b)|alpha_(2)|=|Delta_(2)+zeta-1| <= 0.03:}\begin{equation*} \left|\alpha_{2}\right|=\left|\Delta_{2}+\zeta-1\right| \leqq 0.03 \tag{40.46b} \end{equation*}(40.46b)|α2|=|Δ2+ζ1|0.03
[This limit does not affect the three theories in Box 39.1; of them, only Ni's theory has prior geometry and a universal rest frame; and it predicts isotropic effects in Δ G C / G C Δ G C / G C DeltaG_(C)//G_(C)\Delta G_{\mathrm{C}} / G_{\mathrm{C}}ΔGC/GC [equation (40.44)], but no anisotropic effects. However, other theories with universal rest frames-e.g. Papapetrou's (1954a,b,c) theory-are ruled out by this limit; see Ni (1972), Nordtvedt and Will (1972).]
Whitehead's theory of gravity (which is a two-tensor theory with a rather different type of prior geometry from Ni's) predicts that the galaxy should produce velocityindependent anisotropies in G C G C G_(C)G_{\mathrm{C}}GC. These, in turn, would produce Earth tides with periods of 12 sidereal hours and amplitudes of
Δ g / g 2 × 10 7 100 × ( experimental limit on such amplitudes ) Δ g / g 2 × 10 7 100 × (  experimental limit on   such amplitudes  ) Delta g//g∼2xx10^(-7)∼100 xx((" experimental limit on ")/(" such amplitudes "))\Delta g / g \sim 2 \times 10^{-7} \sim 100 \times\binom{\text { experimental limit on }}{\text { such amplitudes }}Δg/g2×107100×( experimental limit on  such amplitudes )
[Will (1971b)]. The absence of such tides proves Whitehead's theory to be incor-rect-a feat of disproof beyond the power of all redshift, light-deflection, time-delay, and perihelion-shift measurements. (For all these "standard experiments," the predictions of Whitehead and Einstein are identical!)
Equation ( 40.44 ) 40.44 (40.44^('))\left(40.44^{\prime}\right)(40.44) predicts a periodic annual variation of the Cavendish constant on Earth, as the Earth moves around the sun:
v = ( velocity of Earth relative to sun ) + ( velocity of sun relative to preferred frame ) v E + w ; (40.47) ( Δ G C / G C ) averaged over Surface of Earth = 1 2 ( 2 3 α 2 + α 3 α 1 ) ( w 2 + v E 2 + 2 w v E ) v = (  velocity of Earth   relative to sun  ) + (  velocity of sun relative   to preferred frame  ) v E + w ; (40.47) Δ G C / G C  averaged over   Surface of Earth  = 1 2 2 3 α 2 + α 3 α 1 w 2 + v E 2 + 2 w v E {:[v=((" velocity of Earth ")/(" relative to sun "))+((" velocity of sun relative ")/(" to preferred frame "))-=v_(E)+w;],[(40.47)(DeltaG_(C)//G_(C))_({:[" averaged over "],[" Surface of Earth "]:})=(1)/(2)((2)/(3)alpha_(2)+alpha_(3)-alpha_(1))(w^(2)+v_(E)^(2)+2w*v_(E))]:}\begin{gather*} v=\binom{\text { velocity of Earth }}{\text { relative to sun }}+\binom{\text { velocity of sun relative }}{\text { to preferred frame }} \equiv v_{E}+w ; \\ \left(\Delta G_{\mathrm{C}} / G_{\mathrm{C}}\right)_{\substack{\text { averaged over } \\ \text { Surface of Earth }}}=\frac{1}{2}\left(\frac{2}{3} \alpha_{2}+\alpha_{3}-\alpha_{1}\right)\left(w^{2}+v_{E}^{2}+2 w \cdot v_{E}\right) \tag{40.47} \end{gather*}v=( velocity of Earth  relative to sun )+( velocity of sun relative  to preferred frame )vE+w;(40.47)(ΔGC/GC) averaged over  Surface of Earth =12(23α2+α3α1)(w2+vE2+2wvE)
This annual variation, assuming all PPN parameters are of order unity, is 1,000 times larger than the one produced by the Earth's motion in and out through the sun's gravitational potential [equation (40.43)]. In response to this changing Cavendish constant, the Earth's self-gravitational pull should change, and the Earth should "breathe" inward (greater pull) and outward (relaxed pull). The resulting annual variations in the Earth's moment of inertia should produce annual changes in its rotation rate ω ω omega\omegaω (changes in "length of day" as measure by atomic clocks):
δ ω / ω 0.1 ( 2 3 α 2 + α 3 α 1 ) w v E δ ω / ω 0.1 2 3 α 2 + α 3 α 1 w v E delta omega//omega∼0.1((2)/(3)alpha_(2)+alpha_(3)-alpha_(1))w*v_(E)\delta \omega / \omega \sim 0.1\left(\frac{2}{3} \alpha_{2}+\alpha_{3}-\alpha_{1}\right) w \cdot v_{E}δω/ω0.1(23α2+α3α1)wvE
[Nordtvedt and Will (1972)]. Comparison with the measured annual variations of rotation rate (all of which geophysicists attribute to seasonal changes in the Earth's atmosphere) yields the following limit
(40.48) | 2 3 α 2 + α 3 α 1 | =≦ 0.2 (40.48) 2 3 α 2 + α 3 α 1 =≦ 0.2 {:(40.48)|(2)/(3)alpha_(2)+alpha_(3)-alpha_(1)|=≦0.2:}\begin{equation*} \left|\frac{2}{3} \alpha_{2}+\alpha_{3}-\alpha_{1}\right|=\leqq 0.2 \tag{40.48} \end{equation*}(40.48)|23α2+α3α1|=≦0.2
[See Nordtvedt and Will (1972)]. This limit rules out several preferred-frame theories of gravity, including that of Ni (Boxes 39.1 and 39.2).
The experimental results ( 40.21 ) , ( 40.46 ) ( 40.21 ) , ( 40.46 ) (40.21),(40.46)(40.21),(40.46)(40.21),(40.46), and (40.48), when combined, place the following very rough limits on any theory that possesses a Universal rest frame:
| α 1 | = | 7 Δ 1 + Δ 2 4 γ 4 | 0.2 , (40.49) | α 2 | = | Δ 2 + ζ 1 | 0.03 , | α 3 | = | 4 β 1 2 γ 2 ζ | 2 × 10 5 . α 1 = 7 Δ 1 + Δ 2 4 γ 4 0.2 , (40.49) α 2 = Δ 2 + ζ 1 0.03 , α 3 = 4 β 1 2 γ 2 ζ 2 × 10 5 . {:[|alpha_(1)|=|7Delta_(1)+Delta_(2)-4gamma-4| <= 0.2","],[(40.49)|alpha_(2)|=|Delta_(2)+zeta-1| <= 0.03","],[|alpha_(3)|=|4beta_(1)-2gamma-2-zeta| <= 2xx10^(-5).]:}\begin{align*} & \left|\alpha_{1}\right|=\left|7 \Delta_{1}+\Delta_{2}-4 \gamma-4\right| \leqq 0.2, \\ & \left|\alpha_{2}\right|=\left|\Delta_{2}+\zeta-1\right| \leqq 0.03, \tag{40.49}\\ & \left|\alpha_{3}\right|=\left|4 \beta_{1}-2 \gamma-2-\zeta\right| \leqq 2 \times 10^{-5} . \end{align*}|α1|=|7Δ1+Δ24γ4|0.2,(40.49)|α2|=|Δ2+ζ1|0.03,|α3|=|4β12γ2ζ|2×105.
These limits completely disprove all theories with preferred frames that have been examined to date except one devised by Will and Nordtvedt [see Ni (1972); Nordtvedt and Will (1972)].
In some theories of gravity, the result of a Cavendish experiment depends on the chemical composition and internal structure of the test bodies (exercises 40.9 and 40.10). Kruezer (1968) has performed the most accurate search for such effects to date. He finds that G C G C G_(C)G_{\mathrm{C}}GC is the same for fluorine and bromine to a precision of
(40.50) | G C ( bromine ) G C ( fluorine ) G C | 5 × 10 5 . (40.50) G C (  bromine  ) G C (  fluorine  ) G C 5 × 10 5 . {:(40.50)|(G_(C)(" bromine ")-G_(C)(" fluorine "))/(G_(C))| <= 5xx10^(-5).:}\begin{equation*} \left|\frac{G_{\mathrm{C}}(\text { bromine })-G_{\mathrm{C}}(\text { fluorine })}{G_{\mathrm{C}}}\right| \leqq 5 \times 10^{-5} . \tag{40.50} \end{equation*}(40.50)|GC( bromine )GC( fluorine )GC|5×105.

Exercise 40.9. CAVENDISH CONSTANT FOR IDEALIZED SUN

Idealize the sun as a static sphere of perfect fluid at rest at the origin of the PPN coordinates. Then its external gravitational field has the form (40.3), with M M M_(o.)M_{\odot}M given by (40.4). Consequently, a test body of mass m m mmm, located far away at radius r r rrr, is accelerated by a gravitational force
(40.51a) Force = m M / r 2 . (40.51a)  Force  = m M / r 2 {:(40.51a)" Force "=-mM_(o.)//r^(2)". ":}\begin{equation*} \text { Force }=-m M_{\odot} / r^{2} \text {. } \tag{40.51a} \end{equation*}(40.51a) Force =mM/r2
(a) Calculate the mass of the sun, M M MMM, in the sense of the amount of energy required to construct it by adding one spherical shell of matter on top of another, working from the inside outward. [Answer:
M = 0 R ρ 0 ( 1 + Π + 3 γ U ) 4 π r 2 d r rest mass + internal energy 1 2 0 R ρ 0 U 4 π r 2 d r gravitational potential energy = 0 R ρ 0 [ 1 + Π + ( 3 γ 1 2 ) U ] 4 π r 2 d r . ] M = 0 R ρ 0 ( 1 + Π + 3 γ U ) 4 π r 2 d r rest mass + internal energy  1 2 0 R ρ 0 U 4 π r 2 d r gravitational potential energy  = 0 R ρ 0 1 + Π + 3 γ 1 2 U 4 π r 2 d r . {:[M=ubrace(int_(0)^(R_(o.))rho_(0)(1+Pi+3gamma U)4pir^(2)drubrace)_("rest mass + internal energy ")ubrace(-(1)/(2)int_(0)^(R_(o.))rho_(0)U4pir^(2)drubrace)_("gravitational potential energy ")],[{:=int_(0)^(R_(o.))rho_(0)[1+Pi+(3gamma-(1)/(2))U]4pir^(2)dr.]]:}\begin{align*} M & =\underbrace{\int_{0}^{R_{\odot}} \rho_{0}(1+\Pi+3 \gamma U) 4 \pi r^{2} d r}_{\text {rest mass + internal energy }} \underbrace{-\frac{1}{2} \int_{0}^{R_{\odot}} \rho_{0} U 4 \pi r^{2} d r}_{\text {gravitational potential energy }} \\ & \left.=\int_{0}^{R_{\odot}} \rho_{0}\left[1+\Pi+\left(3 \gamma-\frac{1}{2}\right) U\right] 4 \pi r^{2} d r .\right] \end{align*}M=0Rρ0(1+Π+3γU)4πr2drrest mass + internal energy 120Rρ0U4πr2drgravitational potential energy =0Rρ0[1+Π+(3γ12)U]4πr2dr.]
(b) Use the virial theorem [equation (39.21b)] to rewrite equation (40.4) in the form
(40.51c) M = 0 R ρ 0 [ 1 + β 3 Π + ( 2 β 2 + 1 2 β 4 ) U ] 4 π r 2 d r . (40.51c) M = 0 R ρ 0 1 + β 3 Π + 2 β 2 + 1 2 β 4 U 4 π r 2 d r . {:(40.51c)M_(o.)=int_(0)^(R_(o.))rho_(0)[1+beta_(3)Pi+(2beta_(2)+(1)/(2)beta_(4))U]4pir^(2)dr.:}\begin{equation*} M_{\odot}=\int_{0}^{R_{\odot}} \rho_{0}\left[1+\beta_{3} \Pi+\left(2 \beta_{2}+\frac{1}{2} \beta_{4}\right) U\right] 4 \pi r^{2} d r . \tag{40.51c} \end{equation*}(40.51c)M=0Rρ0[1+β3Π+(2β2+12β4)U]4πr2dr.

EXERCISES

(c) Combine the above equations with the definition
(40.51d) Force = G C m M / r 2 (40.51d)  Force  = G C m M / r 2 {:(40.51d)" Force "=-G_(C)mM//r^(2):}\begin{equation*} \text { Force }=-G_{\mathrm{C}} m M / r^{2} \tag{40.51d} \end{equation*}(40.51d) Force =GCmM/r2
of the Cavendish constant for r r rrr far outside the sun, to obtain
(40.52) G C = ( mass of sun as defined by its effect in bending world line of a faraway test particle ) ( mass-energy as defined by applying law of conservation of energy to the steps in the construction of the sun = 1 + ( ρ 0 / M 0 ) [ ( β 3 1 ) Π + 1 2 ( 4 β 2 + β 4 6 γ + 1 ) U ] 4 π r 2 d r (40.52) G C = (  mass of sun as defined by its effect in   bending world line of a faraway test particle  )  mass-energy as defined by applying law of   conservation of energy to the steps in the   construction of the sun  = 1 + ρ 0 / M 0 β 3 1 Π + 1 2 4 β 2 + β 4 6 γ + 1 U 4 π r 2 d r {:[(40.52)G_(C)=(((" mass of sun as defined by its effect in ")/(" bending world line of a faraway test particle ")))/(([" mass-energy as defined by applying law of "],[" conservation of energy to the steps in the "],[" construction of the sun "])],[=1+int(rho_(0)//M_(0))[(beta_(3)-1)Pi+(1)/(2)(4beta_(2)+beta_(4)-6gamma+1)U]4pir^(2)dr]:}\begin{align*} G_{\mathrm{C}} & =\frac{\binom{\text { mass of sun as defined by its effect in }}{\text { bending world line of a faraway test particle }}}{\left(\begin{array}{l} \text { mass-energy as defined by applying law of } \\ \text { conservation of energy to the steps in the } \\ \text { construction of the sun } \end{array}\right.} \tag{40.52}\\ & =1+\int\left(\rho_{0} / M_{0}\right)\left[\left(\beta_{3}-1\right) \Pi+\frac{1}{2}\left(4 \beta_{2}+\beta_{4}-6 \gamma+1\right) U\right] 4 \pi r^{2} d r \end{align*}(40.52)GC=( mass of sun as defined by its effect in  bending world line of a faraway test particle )( mass-energy as defined by applying law of  conservation of energy to the steps in the  construction of the sun =1+(ρ0/M0)[(β31)Π+12(4β2+β46γ+1)U]4πr2dr
Unless β 3 = 1 β 3 = 1 beta_(3)=1\beta_{3}=1β3=1, and 4 β 2 + β 4 6 γ + 1 = 0 4 β 2 + β 4 6 γ + 1 = 0 4beta_(2)+beta_(4)-6gamma+1=04 \beta_{2}+\beta_{4}-6 \gamma+1=04β2+β46γ+1=0 (as they are, of course, in Einstein's theory), G C G C G_(C)G_{\mathrm{C}}GC will depend on the sun's internal structure! Specialize equation (40.52) to "conservative theories of gravity (Box 39.5), and explain why the result is what one would expect from equation (40.43).

Exercise 40.10. CAVENDISH CONSTANT FOR ANY BODY

Extend the analysis of exercise 40.9 to a source that is arbitrarily stressed and has arbitrarv shape and internal velocities (subject to the constraints v 2 1 , | t j k | / ρ 0 1 , U 1 , Π 1 v 2 1 , t j k / ρ 0 1 , U 1 , Π 1 v^(2)≪1,|t_(jk)|//rho_(0)≪1,U≪1,Pi≪1v^{2} \ll 1,\left|t_{j k}\right| / \rho_{0} \ll 1, U \ll 1, \Pi \ll 1v21,|tjk|/ρ01,U1,Π1, of the post-Newtonian approximation). Assume that the body is at rest relative to the universal rest frame. Show that G C G C G_(C)G_{\mathrm{C}}GC depends on the internal structure of the source unless
(40.53) 2 β 1 β 4 = 1 , 4 β 2 + β 4 6 γ = 1 , β 3 = 1 , ζ = 0 , η = 0 (40.53) 2 β 1 β 4 = 1 , 4 β 2 + β 4 6 γ = 1 , β 3 = 1 , ζ = 0 , η = 0 {:(40.53)2beta_(1)-beta_(4)=1","quad4beta_(2)+beta_(4)-6gamma=-1","quadbeta_(3)=1","quad zeta=0","quad eta=0:}\begin{equation*} 2 \beta_{1}-\beta_{4}=1, \quad 4 \beta_{2}+\beta_{4}-6 \gamma=-1, \quad \beta_{3}=1, \quad \zeta=0, \quad \eta=0 \tag{40.53} \end{equation*}(40.53)2β1β4=1,4β2+β46γ=1,β3=1,ζ=0,η=0
Of course, these PPN constraints are all satisfied by Einstein's theory.
The sense in which general relativity predicts geodesic motion for planets and sun

§40.9. DO PLANETS AND THE SUN MOVE ON GEODESICS?

Crucial to solar-system experiments is the question of whether the sun and the planets move on geodesics of spacetime. This question is complicated by the contributions to the spacetime curvature made by the moving body itself.
To elucidate the question-and to obtain an answer in the framework of general relativity-consider an "Einstein elevator" type of argument. The astronomical object under consideration has an outer boundary, and each point on this boundary describes a world line. These world lines define a world tube. Some distance outside of this world tube construct a "buffer zone" as in §20.6. Tailor its inner and outer dimensions, according to the mass and moments of the object and the curvature of the enveloping space ("strength of the tide-producing force of the external gravitational field"), in such a way that the departure ϵ ϵ epsilon\epsilonϵ (cf. §20.6) of the metric from flatness in this buffer zone takes on values equal at most to twice the extremal achievable value ϵ extrem ϵ extrem  epsilon_("extrem ")\epsilon_{\text {extrem }}ϵextrem  (a minimum with respect to variations in r r rrr, a maximum
with respect to variations in direction; in other words, a minimax). Then, apart from errors of order ϵ extrem ϵ extrem  epsilon_("extrem ")\epsilon_{\text {extrem }}ϵextrem , the object can be regarded as moving in an asymptotically flat space. The law of conservation of total 4-momentum applies. It assures one that the object moves in a (locally) straight line with uniform velocity. Consider, next, a "background geometry" that agrees just outside the buffer zone with the actual geometry to accuracy ϵ extrem ϵ extrem  epsilon_("extrem ")\epsilon_{\text {extrem }}ϵextrem  or better, but that inside is a source-free solution of Einstein's field equation. Then, to an accuracy governed by the magnitude of ϵ extrem ϵ extrem  epsilon_("extrem ")\epsilon_{\text {extrem }}ϵextrem , the locally straight line along which the astronomical object moves will be a geodesic of this background geometry.
Insofar as one can give any well-defined meaning to the departure of the actual motion from this geodesic (a task complicated by the fact that the background geometry does not actually exist!), one can calculate this departure by making use of the PPN formalism or some other approximation scheme [see, e.g., Taub (1965)]. This deviation springs ordinarily in substantial measure, and sometimes almost wholly, from a coupling between the Riemann curvature tensor of the external field and the multipole moments of the astronomical object (angular momentum associated with rotation; quadrupole and higher moments associated with deformation; see, e.g., exercises 40.8 and 16.4). This coupling is important for the Earth-moon system, but one need not use relativity to calculate it; Newtonian theory does the job to far greater accuracy than needed-or would, if one understood the interiors of the Earth and the moon well enough! For the planets and sun, the effect is negligible. (Exercise: use Newtonian theory to prove so!).
Thus, in general relativity as applied to the solar system, one can approximate the orbit of the sun, the Earth-moon mass center, and each other planet, as a geodesic of that "background spacetime geometry" which would exist if its own curvature effects were absent. This is the approach used to analyze the perihelion shift for planets in § 40.5 § 40.5 §40.5\S 40.5§40.5 in the context of general relativity, and to derive in exercise 39.15 the post-Newtonian "many-body equations of motion."
In most other metric theories of gravity, including the Dicke-Brans-Jordan theory, there are substantial departures from geodesic motion. The "Einstein elevator" argument fails in these theories because spacetime is endowed not only with a metric, but also with a long-range field that couples indirectly (cf. § § 38.7 § § 38.7 §§38.7\S \S 38.7§§38.7 and 39.2 ) to massive, gravitating bodies.
This phenomenon is best understood in terms of Dicke's argument about the influence of spatial variations of the fundamental constants on experiments of the Eötvös-Dicke type (see §38.6). In a theory where the Cavendish gravitational constant G C G C G_(C)G_{\mathrm{C}}GC depends on position (as it does not and cannot in general relativity), a body with significant self-gravitational energy E grav E grav  E_("grav ")E_{\text {grav }}Egrav  must fall, in a perfectly uniform external Newtonian gravitational field, with an anomalous acceleration:
(40.54) ( acceleration of massive body ) ( acceleration of test body ) = 1 M ( E grav G C ) G C = E grav M G C G C (40.54) (  acceleration of   massive body  ) (  acceleration of   test body  ) = 1 M E grav G C G C = E grav  M G C G C {:[(40.54)((" acceleration of ")/(" massive body "))-((" acceleration of ")/(" test body "))=(1)/(M)((delE_(grav))/(delG_(C)))gradG_(C)],[=(E_("grav "))/(MG_(C))gradG_(C)]:}\begin{align*} \binom{\text { acceleration of }}{\text { massive body }}-\binom{\text { acceleration of }}{\text { test body }} & =\frac{1}{M}\left(\frac{\partial E_{\mathrm{grav}}}{\partial G_{\mathrm{C}}}\right) \nabla G_{\mathrm{C}} \tag{40.54}\\ & =\frac{E_{\text {grav }}}{M G_{\mathrm{C}}} \nabla G_{\mathrm{C}} \end{align*}(40.54)( acceleration of  massive body )( acceleration of  test body )=1M(EgravGC)GC=Egrav MGCGC
Deviations from geodesic motion:
(1) due to curvature coupling
(2) due to spatial dependence of gravitational constant (Nordtvedt effect)
[see equation (38.15)]. In Dicke-Brans-Jordan theory, G C G C G_(C)G_{\mathrm{C}}GC is essentially the reciprocal of the scalar field; and it contains a small part that is proportional to the Newtonian potential, U U UUU [equation (40.43) with the appropriate values of the parameters from Box 39.2]. As a result, the sun falls with an acceleration smaller by one part in 10 6 10 6 10^(6)10^{6}106 than the acceleration of a test body; Jupiter falls with an acceleration one part in 10 9 10 9 10^(9)10^{9}109 smaller; and the Earth, one part in 10 10 10 10 10^(10)10^{10}1010 smaller. Translated into relativistic language: the scalar field, by influencing the gravitational self-energy of a massive body, produces deviations from geodesic motion.
One can use the full PPN formalism of Chapter 29 to calculate the motion of massive bodies in any metric theory of gravity. Nordtvedt (1968b) and Will (1971a) have done this. They find that a massive body at rest in a uniform external field experiences a (Newtonian-type) PPN coordinate acceleration given by
d 2 x j d t 2 = E j k U x k d 2 x j d t 2 = E j k U x k (d^(2)x_(j))/(dt^(2))=E_(jk)(del U)/(delx_(k))\frac{d^{2} x_{j}}{d t^{2}}=E_{j k} \frac{\partial U}{\partial x_{k}}d2xjdt2=EjkUxk
where E j k E j k E_(jk)E_{j k}Ejk is a quantity depending on the body's structure:
E j k = δ j k { 1 ( 7 Δ 1 3 γ 4 β ) E grav m } ( 2 β + 2 β 2 3 γ + Δ 2 2 ) Ω j k m (40.55) Ω j k = 1 2 ρ o ρ o ( x j x j ) ( x k x k ) | x x | 3 d 3 x d 3 x , E grav = Ω j j E j k = δ j k 1 7 Δ 1 3 γ 4 β E grav m 2 β + 2 β 2 3 γ + Δ 2 2 Ω j k m (40.55) Ω j k = 1 2 ρ o ρ o x j x j x k x k x x 3 d 3 x d 3 x , E grav = Ω j j {:[E_(jk)=delta_(jk){1-(7Delta_(1)-3gamma-4beta)(E_(grav))/(m)}-(2beta+2beta_(2)-3gamma+Delta_(2)-2)(Omega_(jk))/(m)],[(40.55)Omega_(jk)=-(1)/(2)int(rho_(o)rho_(o)^(')(x_(j)-x_(j)^('))(x_(k)-x_(k)^(')))/(|x-x^(')|^(3))d^(3)xd^(3)x^(')","quadE_(grav)=sumOmega_(jj)]:}\begin{gather*} E_{j k}=\delta_{j k}\left\{1-\left(7 \Delta_{1}-3 \gamma-4 \beta\right) \frac{E_{\mathrm{grav}}}{m}\right\}-\left(2 \beta+2 \beta_{2}-3 \gamma+\Delta_{2}-2\right) \frac{\Omega_{j k}}{m} \\ \Omega_{j k}=-\frac{1}{2} \int \frac{\rho_{o} \rho_{o}^{\prime}\left(x_{j}-x_{j}^{\prime}\right)\left(x_{k}-x_{k}^{\prime}\right)}{\left|\boldsymbol{x}-\boldsymbol{x}^{\prime}\right|^{3}} d^{3} x d^{3} x^{\prime}, \quad E_{\mathrm{grav}}=\sum \Omega_{j j} \tag{40.55} \end{gather*}Ejk=δjk{1(7Δ13γ4β)Egravm}(2β+2β23γ+Δ22)Ωjkm(40.55)Ωjk=12ρoρo(xjxj)(xkxk)|xx|3d3xd3x,Egrav=Ωjj
Here m m mmm is the body's total mass-energy, Ω j k Ω j k Omega_(jk)\Omega_{j k}Ωjk is the "Chandrasekhar potential-energy tensor," and E grav E grav  E_("grav ")E_{\text {grav }}Egrav  is the body's self-gravitational energy. [Note: Dicke's method of calculating the anomalous acceleration (40.54) breaks down in theories that are not "conservative" (Box 39.5).]
In general relativity, the combinations of PPN coefficients appearing in E j k E j k E_(jk)E_{j k}Ejk vanish; so E j k = δ j k E j k = δ j k E_(jk)=delta_(jk)E_{j k}=\delta_{j k}Ejk=δjk, and the body falls with the usual acceleration-i.e., it moves along a geodesic. But in most other theories of gravity E j k δ j k E j k δ j k E_(jk)!=delta_(jk)E_{j k} \neq \delta_{j k}Ejkδjk; the body does not move on a geodesic; and its acceleration may even be in a different direction than the gradient of the Newtonian potential!
This predicted departure from geodesic motion is called the "Nordtvedt effect." The possibility of such an effect was first noticed in passing by Dicke (1964c), but was discovered independently and explored in great detail by Nordtvedt (1968a,b). The Nordtvedt effect in a theory other than general relativity produces a number of phenomena in the solar system that are potentially observable. [See Nordtvedt (1971b) for an enumeration and references.] The effect most suitable for a test is a "polarization" of the Earth-moon orbit due to the fact that the moon should fall toward the sun with a greater acceleration than does the Earth. This "polarization" results in an eccentricity in the orbit that points always toward the Sun and has the amplitude
Nordtvedt effect in Earth-moon orbit
δ r = 840 [ 3 γ + 4 β 7 Δ 1 1 3 ( 2 β + 2 β 2 3 γ + Δ 2 2 ) ] cm = 67 meters in Ni's theory (Boxes 39.1 and 39.2) = 8.4 2 + ω meters in Dicke-Brans-Jordan theory (Boxes 39.1 and 39.2) = 0 in Einstein's theory. δ r = 840 3 γ + 4 β 7 Δ 1 1 3 2 β + 2 β 2 3 γ + Δ 2 2 cm = 67  meters   in Ni's theory (Boxes  39.1  and 39.2)  = 8.4 2 + ω  meters   in Dicke-Brans-Jordan theory (Boxes  39.1  and 39.2)  = 0  in Einstein's theory.  {:[delta r,=840[3gamma+4beta-7Delta_(1)-(1)/(3)(2beta+2beta_(2)-3gamma+Delta_(2)-2)]cm],[,=67" meters ",," in Ni's theory (Boxes "39.1" and 39.2) "],[,=(8.4)/(2+omega)" meters ",," in Dicke-Brans-Jordan theory (Boxes "39.1" and 39.2) "],[,=0,," in Einstein's theory. "]:}\begin{array}{rlrl} \delta r & =840\left[3 \gamma+4 \beta-7 \Delta_{1}-\frac{1}{3}\left(2 \beta+2 \beta_{2}-3 \gamma+\Delta_{2}-2\right)\right] \mathrm{cm} \\ & =67 \text { meters } & & \text { in Ni's theory (Boxes } 39.1 \text { and 39.2) } \\ & =\frac{8.4}{2+\omega} \text { meters } & & \text { in Dicke-Brans-Jordan theory (Boxes } 39.1 \text { and 39.2) } \\ & =0 & & \text { in Einstein's theory. } \end{array}δr=840[3γ+4β7Δ113(2β+2β23γ+Δ22)]cm=67 meters  in Ni's theory (Boxes 39.1 and 39.2) =8.42+ω meters  in Dicke-Brans-Jordan theory (Boxes 39.1 and 39.2) =0 in Einstein's theory. 

Box 40.4 CATALOG OF EXPERIMENTS

Type of experiment
I. Tests of foundations of general relativity
II. Post-Newtonian
("solar-system") experiments
III. Cosmological observations
IV. Gravitational-Wave experiments
Description of experiment
  1. Tests of uniqueness of free fall (Eötvös-Dicke-Braginsky experiments
  2. Tests for existence of metric (time dilation of particle decays; role of Lorentz group in particle kinematics; etc.)
  3. Searches for new, direct-coupling, long-range fields (HughesDrever experiment; ether-drift experiments)
  4. Gravitational redshift experiments
  5. Constancy, in space and time, of the nongravitational physical constants
  6. Deflection of light and radio waves by Sun
  7. Relativistic delay in round-trip travel time for radar beams passing near Sun
  8. Perihelion shifts and periodic perturbations in planetary orbits
  9. Three-body effects in the Lunar orbit
  10. Precession of gyroscopes ("geodetic precession" and precession due to dragging of inertial frames by Earth's rotation)
  11. Spatial variation of the Cavendish gravitational constant in the solar system
  12. Dependence of the Cavendish gravitational constant on the chemical composition of the gravitating body
  13. Earth tides with sidereal periods
  14. Annual variations in Earth rotation rate
  15. Periodicities in Earth-Moon separation due to breakdown of geodesic motion
  16. Change of Cavendish gravitational constant with time in solar system
  17. Large-scale features of universe (expansion, isotropy, homogeneity; existence and properties of cosmic microwave radiation; . . .)
  18. Agreement of various measures of age of universe (age from expansion; ages of oldest stars; age of solar system)
Existence of waves; propagation speed; polarization properties; ...
Where discussed
§38.3; Figure 1.6; Box 1.1
838.4
§38.7; Figure 38.3
§38.5; Figures 38.1 and 38.2 ; 887.2 38.2 ; 887.2 38.2;887.238.2 ; 887.238.2;887.2, 7.3 , and 7.4
§38.6
§40.3; Box 40.1
§40.4; Box 40.2
§40.5; Box 40.3
$ 40.6 $ 40.6 $40.6\$ 40.6$40.6
§40.7
$ 840.8 $ 840.8 $840.8\$ 840.8$840.8 and 40.9
§40.8
$40.8
§40.8
$ 40.9 $ 40.9 $40.9\$ 40.9$40.9
§40.8
Chapters 27-30; especially Chapter 29
§29.7
Chapters 35-37; especially Chapter 37

Figure 40.4. (facing page)

Measuring the separation between earth and moon by determining the time-delay (about 2.5 sec ) between the emission of light from a laser on the earth and the return of this light to the earth. A key element in the program is a corner reflector, the first of which was landed on the moon July 20, 1969, by the Apollo 11 flight crew. In November 1971, there were three such reflectors on the moon: two American, and one French-built and Soviet-landed. A pulsed ruby laser projects a beam out of the 107 -inch reflecting telescope of the McDonald Observatory of the University of Texas, on Mount Locke, 119 miles east of El Paso. This beam makes a spot of light on the moon's surface about 3.2 km in diameter. Laser light is bounced straight back to the earth by the "laser ranging retroreflectors" ( LR 3 ) LR 3 (LR^(3))\left(\mathrm{LR}^{3}\right)(LR3). Each consists of an aluminum panel of 46 cm by 46 cm with 100 fused silica corner cubes each 3.8 cm in diameter. The first reflector ever set up appears in the first inset, near the lunar landing module. It is tilted with respect to the landscape of the moon. The photograph was made shortly before astronauts Neil A. Armstrong and Edwin E. Aldrin, Jr., took off for the earth. The second inset is a photograph made by D. G. Currie of the field of view in the guiding eyepiece of the McDonald 107 -inch telescope in an interval when the laser was not firing at the Apollo 11 site. One guides the telescope to Tranquility Base (small circle) by aligning fiducial marks on more visible moonscape features. In November 1971, the LR 3 LR 3 LR^(3)\mathrm{LR}^{3}LR3 experiment and continuing time-of-flight measurements were the responsibility of the National Aeronautics and Space Administration and a Lunar Retroreflecting Ranging Team of representatives from several centers of research. One of the members of this team, Carroll Alley, of the University of Maryland, is hereby thanked for his kindness in providing the photographs used in this montage. Thanks to this NASA work, the distance between the laser source on the earth and the reflectors on the moon is known with an accuracy now better than half a meter. The astronauts left behind on the moon not only LR 3 LR 3 LR^(3)\mathrm{LR}^{3}LR3 and a seismometer and other equipment, but also a plaque: "We came in peace for all mankind."
By the mid 1970's, lunar laser-ranging data will probably be able to determine the amplitudes of this polarization to a precision of one meter or better [see Bender et al. (1971); also Figure 40.4].

§40.10. SUMMARY OF EXPERIMENTAL TESTS OF GENERAL RELATIVITY

No longer is general relativity "a theorist's Paradise, but an experimentalist's Hell." It is now a Paradise for all-as one can see quickly by perusing the catalog of experiments given in Box 40.4 on page 1129. Moreover, general relativity has emerged from each of its tests unscathed-a remarkable 1973 tribute to the 1915 genius of Albert Einstein.

FRONTIERS

Wherein the reader-who, during a life of continued variety for forty chapters (besides the Preface), was eight chapters a mathematician, four times enticed (once by an old friend), four chapters a cosmologist, and four chapters a transported astrophysicist in the land of black holes, and who at last inherited a wealth of experiments, lived honest, and became a
True Believer-now ventures forth in search of new
frontiers to conquer.

синоттия 41

SPINORS

§41.1. REFLECTIONS, ROTATIONS, AND THE COMBINATION OF ROTATIONS

Spinors and their applications in relativity grew out of the analysis of "rotations," first in space, then in spacetime. Take a cube (Figure 41.1). Rotate it about one axis through 90 90 90^(@)90^{\circ}90. Then pick another axis at right angles to the first. About it rotate the cube again through 90 90 90^(@)90^{\circ}90. In this way the cube is carried from the orientation marked "Initial" to that marked "Final." How can one make this net transformation in a single step, with a single rotation? In other words, what is the law for the combination of rotations?
Were rotations described by vectors, then one could apply the law of combination of vectors. The resultant of two vectors of the same magnitude ( 90 ) 90 (90^(@))\left(90^{\circ}\right)(90) separated by a right angle, is a single vector that (1) lies in the same plane and (2) has the magnitude 2 1 / 2 × 90 = 127.28 2 1 / 2 × 90 = 127.28 2^(1//2)xx90^(@)=127.28^(@)2^{1 / 2} \times 90^{\circ}=127.28^{\circ}21/2×90=127.28. Both predictions are wrong. To turn the cube from initial to final orientation in a single turn, (1) take an axis running from the center through the vertex A A AAA and (2) rotate through 120 120 120^(@)120^{\circ}120.
What computational algorithm can ever reproduce a law of combination of rotations apparently so strange? On the evening of October 16, 1843, William Rowan Hamilton was walking with his wife along the Royal Canal in Dublin when the answer leaped to his mind, the fruit of years of reflection. With his knife he then and there carved on a stone on Brougham Bridge the formulas*
i 2 = j 2 = k 2 = i j k = 1 , i 2 = j 2 = k 2 = i j k = 1 , i^(2)=j^(2)=k^(2)=ijk=-1,i^{2}=j^{2}=\boldsymbol{k}^{2}=i j k=-1,i2=j2=k2=ijk=1,
This chapter is entirely Track 2. No earlier Track-2 material is needed as preparation for it, nor is it needed as preparation for any later chapter.
The problem of combining rotations
Figure 41.1.
Rotation about the vertical axis through 90 90 90^(@)90^{\circ}90, followed by rotation about the horizontal axis through 90 90 90^(@)90^{\circ}90, gives a net change in orientation that can be achieved by a single rotation through 120 120 120^(@)120^{\circ}120 about an axis emergent from the center through the corner A A AAA.
which in today's notation,
(41.1) σ x = 0 1 1 0 = i i , σ y = 0 i i 0 = i j , σ z = 1 0 0 1 = i k , (41.1) σ x = 0 1 1 0 = i i , σ y = 0 i i 0 = i j , σ z = 1 0 0 1 = i k , {:(41.1)sigma_(x)=||[0,1],[1,0]||=ii","quadsigma_(y)=||[0,-i],[i,0]||=ij","quadsigma_(z)=||[1,0],[0,-1]||=ik",":}\sigma_{x}=\left\|\begin{array}{ll} 0 & 1 \tag{41.1}\\ 1 & 0 \end{array}\right\|=i \boldsymbol{i}, \quad \sigma_{y}=\left\|\begin{array}{rr} 0 & -i \\ i & 0 \end{array}\right\|=i j, \quad \sigma_{z}=\left\|\begin{array}{rr} 1 & 0 \\ 0 & -1 \end{array}\right\|=i \boldsymbol{k},(41.1)σx=0110=ii,σy=0ii0=ij,σz=1001=ik,
take the form
(41.2) σ x 2 = σ y 2 = σ z 2 = 1 σ x σ y = σ y σ x = i σ z (and cyclic permutations). (41.2) σ x 2 = σ y 2 = σ z 2 = 1 σ x σ y = σ y σ x = i σ z  (and cyclic permutations).  {:[(41.2)sigma_(x)^(2)=sigma_(y)^(2)=sigma_(z)^(2)=1],[sigma_(x)sigma_(y)=-sigma_(y)sigma_(x)=isigma_(z)" (and cyclic permutations). "]:}\begin{align*} \sigma_{x}{ }^{2} & =\sigma_{y}{ }^{2}=\sigma_{z}{ }^{2}=1 \tag{41.2}\\ \sigma_{x} \sigma_{y} & =-\sigma_{y} \sigma_{x}=i \sigma_{z} \text { (and cyclic permutations). } \end{align*}(41.2)σx2=σy2=σz2=1σxσy=σyσx=iσz (and cyclic permutations). 
To any rotation is associated a quantity (Hamilton's "quaternion;" today's "spin matrix" or "spinor transformation" or "rotation operator")
(41.3) R = cos ( θ / 2 ) i sin ( θ / 2 ) ( σ x cos α + σ y cos β + σ z cos γ ) , (41.3) R = cos ( θ / 2 ) i sin ( θ / 2 ) σ x cos α + σ y cos β + σ z cos γ , {:(41.3)R=cos(theta//2)-i sin(theta//2)(sigma_(x)cos alpha+sigma_(y)cos beta+sigma_(z)cos gamma)",":}\begin{equation*} R=\cos (\theta / 2)-i \sin (\theta / 2)\left(\sigma_{x} \cos \alpha+\sigma_{y} \cos \beta+\sigma_{z} \cos \gamma\right), \tag{41.3} \end{equation*}(41.3)R=cos(θ/2)isin(θ/2)(σxcosα+σycosβ+σzcosγ),
where θ θ theta\thetaθ is the angle of rotation and α , β , γ α , β , γ alpha,beta,gamma\alpha, \beta, \gammaα,β,γ are the angles between the axis of rotation and the coordinate axes. A rotation described by R 1 R 1 R_(1)R_{1}R1 followed by a rotation described by R 2 R 2 R_(2)R_{2}R2 gives a net change in orientation described by the single rotation
(41.4) R 3 = R 2 R 1 . (41.4) R 3 = R 2 R 1 . {:(41.4)R_(3)=R_(2)R_(1).:}\begin{equation*} R_{3}=R_{2} R_{1} . \tag{41.4} \end{equation*}(41.4)R3=R2R1.
This is Hamilton's formula for the combination of two rotations (steps toward it by Euler in 1776; obtained by Gauss in 1819 but never published by him).
In the example in Figure 41.1,
R 1 ( rotation by θ = 90 about z -axis ) = ( 1 i σ z ) / 2 1 / 2 , R 2 (rotation by θ = 90 about x -axis ) = ( 1 i σ x ) / 2 1 / 2 , R 1  rotation by  θ = 90  about  z -axis  = 1 i σ z / 2 1 / 2 , R 2  (rotation by  θ = 90  about  x -axis  = 1 i σ x / 2 1 / 2 , {:[R_(1)(" rotation by "theta=90^(@)" about "z"-axis ")=(1-isigma_(z))//2^(1//2)","],[{:R_(2)" (rotation by "theta=90^(@)" about "x"-axis ")=(1-isigma_(x))//2^(1//2)","]:}\begin{aligned} & R_{1}\left(\text { rotation by } \theta=90^{\circ} \text { about } z \text {-axis }\right)=\left(1-i \sigma_{z}\right) / 2^{1 / 2}, \\ & \left.R_{2} \text { (rotation by } \theta=90^{\circ} \text { about } x \text {-axis }\right)=\left(1-i \sigma_{x}\right) / 2^{1 / 2}, \end{aligned}R1( rotation by θ=90 about z-axis )=(1iσz)/21/2,R2 (rotation by θ=90 about x-axis )=(1iσx)/21/2,
and the product of the two is
R 2 R 1 = ( 1 i σ x + i σ y i σ z ) / 2 = cos 60 i sin 60 ( σ x / 3 1 / 2 σ y / 3 1 / 2 + σ z / 3 1 / 2 ) . R 2 R 1 = 1 i σ x + i σ y i σ z / 2 = cos 60 i sin 60 σ x / 3 1 / 2 σ y / 3 1 / 2 + σ z / 3 1 / 2 . {:[R_(2)R_(1)=(1-isigma_(x)+isigma_(y)-isigma_(z))//2],[=cos 60^(@)-i sin 60^(@)(sigma_(x)//3^(1//2)-sigma_(y)//3^(1//2)+sigma_(z)//3^(1//2)).]:}\begin{aligned} R_{2} R_{1} & =\left(1-i \sigma_{x}+i \sigma_{y}-i \sigma_{z}\right) / 2 \\ & =\cos 60^{\circ}-i \sin 60^{\circ}\left(\sigma_{x} / 3^{1 / 2}-\sigma_{y} / 3^{1 / 2}+\sigma_{z} / 3^{1 / 2}\right) . \end{aligned}R2R1=(1iσx+iσyiσz)/2=cos60isin60(σx/31/2σy/31/2+σz/31/2).
Figure 41.2.
Reflection in the plane M P Q M P Q MPQM P QMPQ carries A A AAA to B B BBB. Reflection in the plane N P Q N P Q NPQN P QNPQ carries B B BBB to C C CCC. The combination of the two reflections in the two planes separated by the angle θ / 2 θ / 2 theta//2\theta / 2θ/2 produces the same end result (transformation from A A AAA to C C CCC ) as rotation through the angle θ θ theta\thetaθ about the line P Q P Q PQP QPQ.
According to Hamilton's rule (41.3), this result implies a net rotation through 120 120 120^(@)120^{\circ}120 about a line that makes equal angles with the x x xxx-axis, the y y yyy-axis, and the z z zzz-axis, in conformity with what one already saw in Figure 41.1 (axis of rotation running from center of cube through the corner A A AAA ).
What one has just done in the special example one can do in the general case: obtain the parameters θ 3 , α 3 , β 3 , γ 3 θ 3 , α 3 , β 3 , γ 3 theta_(3),alpha_(3),beta_(3),gamma_(3)\theta_{3}, \alpha_{3}, \beta_{3}, \gamma_{3}θ3,α3,β3,γ3 of the net rotation (four unknowns!) by identifying the four coefficients of the four Hamilton units 1 , i σ x , i σ y , i σ z 1 , i σ x , i σ y , i σ z 1,-isigma_(x),-isigma_(y),-isigma_(z)1,-i \sigma_{x},-i \sigma_{y},-i \sigma_{z}1,iσx,iσy,iσz on both sides of the equation R 3 = R 2 R 1 R 3 = R 2 R 1 R_(3)=R_(2)R_(1)R_{3}=R_{2} R_{1}R3=R2R1. In this way one arrives at the four prequaternion formulas of Olinde Rodrigues (1840) for the combination of the two rotations.
Why do half-angles put in an appearance? And what is behind the law of combination of rotations? The answer to both questions is the same: a rotation through the angle θ θ theta\thetaθ about a given axis may be visualized as the consequence of successive reflections in two planes that meet along that axis at the angle θ / 2 θ / 2 theta//2\theta / 2θ/2 (Figure 41.2). Two rotations therefore imply four reflections. However, it can be arranged that reflections no. 2 and no. 3 take place in the same plane, the plane that includes the two axes of rotation. Then reflection no. 3 exactly undoes reflection no. 2. By now there remain only reflections no. 1 and no. 4 , which together constitute one rotation: the net rotation that was desired (Figures 41.3 and 41.4).
The rotation
(41.3) R = cos ( θ / 2 ) i sin ( θ / 2 ) ( σ x cos α + σ y cos β + σ z cos γ ) (41.3) R = cos ( θ / 2 ) i sin ( θ / 2 ) σ x cos α + σ y cos β + σ z cos γ {:(41.3)R=cos(theta//2)-i sin(theta//2)(sigma_(x)cos alpha+sigma_(y)cos beta+sigma_(z)cos gamma):}\begin{equation*} R=\cos (\theta / 2)-i \sin (\theta / 2)\left(\sigma_{x} \cos \alpha+\sigma_{y} \cos \beta+\sigma_{z} \cos \gamma\right) \tag{41.3} \end{equation*}(41.3)R=cos(θ/2)isin(θ/2)(σxcosα+σycosβ+σzcosγ)
is undone by the inverse rotation
( ) R 1 = cos ( θ / 2 ) + i sin ( θ / 2 ) ( σ x cos α + σ y cos β + σ z cos γ ) ( ) R 1 = cos ( θ / 2 ) + i sin ( θ / 2 ) σ x cos α + σ y cos β + σ z cos γ {:('")"R^(-1)=cos(theta//2)+i sin(theta//2)(sigma_(x)cos alpha+sigma_(y)cos beta+sigma_(z)cos gamma):}\begin{equation*} R^{-1}=\cos (\theta / 2)+i \sin (\theta / 2)\left(\sigma_{x} \cos \alpha+\sigma_{y} \cos \beta+\sigma_{z} \cos \gamma\right) \tag{$\prime$} \end{equation*}()R1=cos(θ/2)+isin(θ/2)(σxcosα+σycosβ+σzcosγ)
Thus the product of the two rotation operators
(41.5) R R 1 = R 1 R = 1 (41.5) R R 1 = R 1 R = 1 {:(41.5)RR^(-1)=R^(-1)R=1:}\begin{equation*} R R^{-1}=R^{-1} R=1 \tag{41.5} \end{equation*}(41.5)RR1=R1R=1
is an operator, the unit operator, that leaves unchanged everything that it acts on. The reciprocal R 1 R 1 R^(-1)R^{-1}R1 of the combination R = R 2 R 1 R = R 2 R 1 R=R_(2)R_(1)R=R_{2} R_{1}R=R2R1 of two rotations is
( ) R 1 = R 1 1 R 2 1 ( ) R 1 = R 1 1 R 2 1 {:('")"R^(-1)=R_(1)^(-1)R_(2)^(-1):}\begin{equation*} R^{-1}=R_{1}^{-1} R_{2}^{-1} \tag{$\prime$} \end{equation*}()R1=R11R21
(reverse order of factors!), as one verifies by substitution into (41.5).
Geometric reason that half angles appear in rotation operators
Algebraic properties of rotation operators
Figure 41.3.
Composition of two rotations seen in terms of reflections. The first rotation (for instance, 90 90 90^(@)90^{\circ}90 about O Z O Z OZO ZOZ in the example of Figure 41.1.) is represented in terms of reflection 1 followed by reflection 2 (the planes of the two reflections being separated by 90 / 2 = 45 90 / 2 = 45 90^(@)//2=45^(@)90^{\circ} / 2=45^{\circ}90/2=45 in the example). The second reflection appears as the resultant of reflections 3 and 4. But the reflections 2 and 3 take place in the common plane ZOX. Therefore one reflection undoes the other. Thus the sequence of four operations 1234 collapses to the two reflections 1 and 4 . Their place in turn is taken by a single rotation about the axis O A O A OAO AOA.
The conjugate transpose, M M M^(**)M^{*}M, of a matrix M M MMM is obtained by taking the conjugate complex of every element in the matrix and then interchanging rows and columns. By direct inspection of matrix expressions (41.1) one sees that σ x = σ x , σ y = σ y σ x = σ x , σ y = σ y sigma_(x)^(**)=sigma_(x),sigma_(y)^(**)=sigma_(y)\sigma_{x}{ }^{*}=\sigma_{x}, \sigma_{y}{ }^{*}=\sigma_{y}σx=σx,σy=σy, σ z = σ z σ z = σ z sigma_(z)^(**)=sigma_(z)\sigma_{z}{ }^{*}=\sigma_{z}σz=σz. Such matrices are said to be Hermitian. The conjugate transpose of the product M = P Q M = P Q M=PQM=P QM=PQ of two matrices is the product M = Q P M = Q P M^(**)=Q^(**)P^(**)M^{*}=Q^{*} P^{*}M=QP of the individual conjugate transposed matrices taken in the reverse order. For the rotation matrix written down above, note that R = R 1 R = R 1 R^(**)=R^(-1)R^{*}=R^{-1}R=R1. Such a matrix is said to be unitary. The
Figure 41.4.
Law of composition of rotations epitomized by a spherical triangle in which each of the three important angles represents half an angle of rotation.
determinant of a unitary matrix may be seen to have absolute value unity from the following line of argument:
1 = det ( unit matrix ) = det ( R R 1 ) = det ( R R ) = det R det R (41.6) = | det R | 2 . 1 = det (  unit matrix  ) = det R R 1 = det R R = det R det R (41.6) = | det R | 2 . {:[1=det(" unit matrix ")=det(RR^(-1))],[=det(RR^(**))=det R detR^(**)],[(41.6)=|det R|^(2).]:}\begin{align*} 1 & =\operatorname{det}(\text { unit matrix })=\operatorname{det}\left(R R^{-1}\right) \\ & =\operatorname{det}\left(R R^{*}\right)=\operatorname{det} R \operatorname{det} R^{*} \\ & =|\operatorname{det} R|^{2} . \tag{41.6} \end{align*}1=det( unit matrix )=det(RR1)=det(RR)=detRdetR(41.6)=|detR|2.
In actuality the determinant of the rotation spin matrix is necessarily unity ("unimodular matrix") as shown in the following exercises

Exercise 41.1. ELEMENTARY FEATURES OF THE ROTATION MATRIX

EXERCISES

Write equation (41.3) in the form
R ( θ ) = cos ( θ / 2 ) i sin ( θ / 2 ) ( σ n ) , R ( θ ) = cos ( θ / 2 ) i sin ( θ / 2 ) ( σ n ) , R(theta)=cos(theta//2)-i sin(theta//2)(sigma*n),R(\theta)=\cos (\theta / 2)-i \sin (\theta / 2)(\boldsymbol{\sigma} \cdot \boldsymbol{n}),R(θ)=cos(θ/2)isin(θ/2)(σn),
and establish the following properties:
(a)
( σ n ) 2 = 1 unit matrix; ( σ n ) 2 = 1  unit matrix;  (sigma*n)^(2)=1-=" unit matrix; "(\boldsymbol{\sigma} \cdot \boldsymbol{n})^{2}=1 \equiv \text { unit matrix; }(σn)2=1 unit matrix; 
(b) tr ( σ n ) = 0 tr ( σ n ) = 0 quad tr(sigma*n)=0\quad \operatorname{tr}(\boldsymbol{\sigma} \cdot \boldsymbol{n})=0tr(σn)=0 (tr means "trace," i.e., sum of diagonal elements);
(c)
(d)
[ R , ( σ n ) ] [ commutator ] R ( σ n ) ( σ ) R = 0 ; (41.7) d R d θ = i 2 ( σ n ) R . [ R , ( σ n ) ] [ commutator  ] R ( σ n ) ( σ ) R = 0 ; (41.7) d R d θ = i 2 ( σ n ) R . {:[ubrace([R,(sigma*n)]ubrace)_(["commutator "])-=R(sigma*n)-(sigma)R=0;],[(41.7)(dR)/(d theta)=-(i)/(2)(sigma*n)R.]:}\begin{gather*} \underbrace{[R,(\boldsymbol{\sigma} \cdot \boldsymbol{n})]}_{[\text {commutator }]} \equiv R(\boldsymbol{\sigma} \cdot \boldsymbol{n})-(\boldsymbol{\sigma}) R=0 ; \\ \frac{d R}{d \theta}=-\frac{i}{2}(\boldsymbol{\sigma} \cdot \boldsymbol{n}) R . \tag{41.7} \end{gather*}[R,(σn)][commutator ]R(σn)(σ)R=0;(41.7)dRdθ=i2(σn)R.
[Note that if one thinks of θ θ theta\thetaθ as increasing with angular velocity ω ω omega\omegaω, so d θ / d t = ω = d θ / d t = ω = d theta//dt=omega=d \theta / d t=\omega=dθ/dt=ω= constant, then this last equation reads
(41.7') d R d t = i 2 ( σ ω ) R (41.7') d R d t = i 2 ( σ ω ) R {:(41.7')(dR)/(dt)=-(i)/(2)(sigma*omega)R:}\begin{equation*} \frac{d R}{d t}=-\frac{i}{2}(\boldsymbol{\sigma} \cdot \omega) R \tag{41.7'} \end{equation*}(41.7')dRdt=i2(σω)R
where ω = ω n ω = ω n omega=omega n\omega=\omega nω=ωn.]

Exercise 41.2. ROTATION MATRIX HAS UNIT DETERMINANT

Recall from exercise 5.5 that for any matrix M M MMM one has
d [ ln ( det M ) ] = tr ( M 1 d M ) d [ ln ( det M ) ] = tr M 1 d M d[ln(det M)]=tr(M^(-1)dM)d[\ln (\operatorname{det} M)]=\operatorname{tr}\left(M^{-1} d M\right)d[ln(detM)]=tr(M1dM)
and use this to show that det R det R det R\operatorname{det} RdetR in (41.7) is constant, and therefore equal to ( det R ) θ = 0 = 1 ( det R ) θ = 0 = 1 (det R)_(theta=0)=1(\operatorname{det} R)_{\theta=0}=1(detR)θ=0=1.
Infinitesimal rotations
Representation of a 3-vector as a spin matrix

§41.2. INFINITESIMAL ROTATIONS

A given rotation can be obtained by performing in turn two rotations of half the magnitude, or four rotations of a fourth the magnitude, or eight of an eighth the magnitude, and so on. Thus one arrives in the limit at the concept of an infinitesimal rotation described by the spin matrix
R = 1 ( i / 2 ) ( σ x d θ y z + σ y d θ z x + σ z d θ x y ) R = 1 ( i / 2 ) σ x d θ y z + σ y d θ z x + σ z d θ x y R=1-(i//2)(sigma_(x)dtheta_(yz)+sigma_(y)dtheta_(zx)+sigma_(z)dtheta_(xy))R=1-(i / 2)\left(\sigma_{x} d \theta_{y z}+\sigma_{y} d \theta_{z x}+\sigma_{z} d \theta_{x y}\right)R=1(i/2)(σxdθyz+σydθzx+σzdθxy)
or
(41.8) R = 1 ( i d θ / 2 ) ( σ n ) (41.8) R = 1 ( i d θ / 2 ) ( σ n ) {:(41.8)R=1-(id theta//2)(sigma*n):}\begin{equation*} R=1-(i d \theta / 2)(\boldsymbol{\sigma} \cdot \boldsymbol{n}) \tag{41.8} \end{equation*}(41.8)R=1(idθ/2)(σn)
Here the quantities
d θ y z = d θ z y = n x d θ = cos α d θ , (41.9) d θ z x = d θ x z = n y d θ = cos β d θ , d θ x y = d θ y x = n z d θ = cos γ d θ , d θ y z = d θ z y = n x d θ = cos α d θ , (41.9) d θ z x = d θ x z = n y d θ = cos β d θ , d θ x y = d θ y x = n z d θ = cos γ d θ , {:[dtheta_(yz)=-dtheta_(zy)=n^(x)d theta=cos alpha d theta","],[(41.9)dtheta_(zx)=-dtheta_(xz)=n^(y)d theta=cos beta d theta","],[dtheta_(xy)=-dtheta_(yx)=n^(z)d theta=cos gamma d theta","]:}\begin{align*} d \theta_{y z} & =-d \theta_{z y}=n^{x} d \theta=\cos \alpha d \theta, \\ d \theta_{z x} & =-d \theta_{x z}=n^{y} d \theta=\cos \beta d \theta, \tag{41.9}\\ d \theta_{x y} & =-d \theta_{y x}=n^{z} d \theta=\cos \gamma d \theta, \end{align*}dθyz=dθzy=nxdθ=cosαdθ,(41.9)dθzx=dθxz=nydθ=cosβdθ,dθxy=dθyx=nzdθ=cosγdθ,
are the components of the infinitesimal rotation in the three indicated planes. An infinitesimal rotation in the ( x , y ) ( x , y ) (x,y)(x, y)(x,y)-plane through the angle d θ x y d θ x y dtheta_(xy)d \theta_{x y}dθxy transforms the vector x = ( x , y , z ) x = ( x , y , z ) x=(x,y,z)\boldsymbol{x}=(x, y, z)x=(x,y,z) into a new vector with changed components x x x^(')x^{\prime}x and y y y^(')y^{\prime}y but with unchanged component z = z z = z z^(')=zz^{\prime}=zz=z. More generally, the infinitesimal rotation (41.8) considered in this same "active" sense* produces the transformation
x x x x x longrightarrowx^(')x \longrightarrow x^{\prime}xx
with
x = x ( d θ x y ) y ( d θ x z ) z (41.10) y = ( d θ y x ) x + y ( d θ y z ) z z = ( d θ z x ) x ( d θ z y ) y + z x = x d θ x y y d θ x z z (41.10) y = d θ y x x + y d θ y z z z = d θ z x x d θ z y y + z {:[x^(')=x-(dtheta_(xy))y-(dtheta_(xz))z],[(41.10)y^(')=-(dtheta_(yx))x+y-(dtheta_(yz))z],[z^(')=-(dtheta_(zx))x-(dtheta_(zy))y+z]:}\begin{gather*} x^{\prime}=x-\left(d \theta_{x y}\right) y-\left(d \theta_{x z}\right) z \\ y^{\prime}=-\left(d \theta_{y x}\right) x+y-\left(d \theta_{y z}\right) z \tag{41.10}\\ z^{\prime}=-\left(d \theta_{z x}\right) x-\left(d \theta_{z y}\right) y+z \end{gather*}x=x(dθxy)y(dθxz)z(41.10)y=(dθyx)x+y(dθyz)zz=(dθzx)x(dθzy)y+z
Spinor calculus provides an alternative (and shorthand!) means to calculate the foregoing effect of a rotation on a vector. Associate with the vector x x x\boldsymbol{x}x the spin matrix
(41.11) X = x σ x + y σ y + z σ z = ( x σ ) (41.11) X = x σ x + y σ y + z σ z = ( x σ ) {:(41.11)X=xsigma_(x)+ysigma_(y)+zsigma_(z)=(x*sigma):}\begin{equation*} X=x \sigma_{x}+y \sigma_{y}+z \sigma_{z}=(x \cdot \boldsymbol{\sigma}) \tag{41.11} \end{equation*}(41.11)X=xσx+yσy+zσz=(xσ)
and with the vector x x x^(')\boldsymbol{x}^{\prime}x a corresponding spin matrix or quaternion X X X^(')X^{\prime}X. Then the effect of the rotation is summarized in the formula
$$
(41.12) X X = R X R (41.12) X X = R X R {:(41.12)X longrightarrowX^(')=RXR^(**):}\begin{equation*} X \longrightarrow X^{\prime}=R X R^{*} \tag{41.12} \end{equation*}(41.12)XX=RXR
$$
Test this formula for the general infinitesimal rotation (41.10). It reads
( x σ ) = [ 1 ( i d θ / 2 ) ( σ n ) ] ( x σ ) [ 1 + ( i d θ / 2 ) ( σ n ) ] x σ = [ 1 ( i d θ / 2 ) ( σ n ) ] ( x σ ) [ 1 + ( i d θ / 2 ) ( σ n ) ] (x^(')*sigma)=[1-(id theta//2)(sigma*n)](x*sigma)[1+(id theta//2)(sigma*n)]\left(\boldsymbol{x}^{\prime} \cdot \boldsymbol{\sigma}\right)=[1-(i d \theta / 2)(\boldsymbol{\sigma} \cdot \boldsymbol{n})](\boldsymbol{x} \cdot \boldsymbol{\sigma})[1+(i d \theta / 2)(\boldsymbol{\sigma} \cdot \boldsymbol{n})](xσ)=[1(idθ/2)(σn)](xσ)[1+(idθ/2)(σn)]
or, to the first order in the quantity d θ d θ d thetad \thetadθ,
(41.13) ( x σ ) = ( x σ ) + ( i d θ / 2 ) [ ( x σ ) ( σ n ) ( σ n ) ( x σ ) ] . (41.13) x σ = ( x σ ) + ( i d θ / 2 ) [ ( x σ ) ( σ n ) ( σ n ) ( x σ ) ] . {:(41.13)(x^(')*sigma)=(x*sigma)+(id theta//2)[(x*sigma)(sigma*n)-(sigma*n)(x*sigma)].:}\begin{equation*} \left(\boldsymbol{x}^{\prime} \cdot \sigma\right)=(\boldsymbol{x} \cdot \sigma)+(i d \theta / 2)[(\boldsymbol{x} \cdot \boldsymbol{\sigma})(\boldsymbol{\sigma} \cdot \boldsymbol{n})-(\boldsymbol{\sigma} \cdot \boldsymbol{n})(\boldsymbol{x} \cdot \boldsymbol{\sigma})] . \tag{41.13} \end{equation*}(41.13)(xσ)=(xσ)+(idθ/2)[(xσ)(σn)(σn)(xσ)].
The product of spin matrices A = ( a σ ) A = ( a σ ) A=(a*sigma)A=(\boldsymbol{a} \cdot \boldsymbol{\sigma})A=(aσ) and B = ( b σ ) B = ( b σ ) B=(b*sigma)B=(\boldsymbol{b} \cdot \boldsymbol{\sigma})B=(bσ) built from two distinct vectors a a a\boldsymbol{a}a and b b b\boldsymbol{b}b is
A B = ( a σ ) ( b σ ) = a x b x σ x 2 + a x b y σ x σ y + , A B = ( a σ ) ( b σ ) = a x b x σ x 2 + a x b y σ x σ y + , AB=(a*sigma)(b*sigma)=a^(x)b^(x)sigma_(x)^(2)+a^(x)b^(y)sigma_(x)sigma_(y)+dots,A B=(\boldsymbol{a} \cdot \boldsymbol{\sigma})(\boldsymbol{b} \cdot \boldsymbol{\sigma})=a^{x} b^{x} \sigma_{x}^{2}+a^{x} b^{y} \sigma_{x} \sigma_{y}+\ldots,AB=(aσ)(bσ)=axbxσx2+axbyσxσy+,
or, according to (41.2),
(41.14) A B = ( a b ) + i ( a × b ) σ (41.14) A B = ( a b ) + i ( a × b ) σ {:(41.14)AB=(a*b)+i(a xx b)*sigma:}\begin{equation*} A B=(\boldsymbol{a} \cdot \boldsymbol{b})+i(\boldsymbol{a} \times \boldsymbol{b}) \cdot \boldsymbol{\sigma} \tag{41.14} \end{equation*}(41.14)AB=(ab)+i(a×b)σ
Employ this formula to evaluate the right-hand side of (41.13). In the square brackets, the terms in ( x n ) ( x n ) (x*n)(\boldsymbol{x} \cdot \boldsymbol{n})(xn) have opposite signs and cancel. In contrast, the terms in ( n × x ) ( n × x ) (n xx x)(\boldsymbol{n} \times \boldsymbol{x})(n×x) have the same sign. They combine to cancel the factor 2 in ( d θ / 2 ) ( d θ / 2 ) (d theta//2)(d \theta / 2)(dθ/2). End up with
( x σ ) = ( x σ ) + d θ ( n × x ) σ x σ = ( x σ ) + d θ ( n × x ) σ (x^(')*sigma)=(x*sigma)+d theta(n xx x)*sigma\left(\boldsymbol{x}^{\prime} \cdot \boldsymbol{\sigma}\right)=(\boldsymbol{x} \cdot \boldsymbol{\sigma})+d \theta(\boldsymbol{n} \times \boldsymbol{x}) \cdot \boldsymbol{\sigma}(xσ)=(xσ)+dθ(n×x)σ
or
(41.15) x = [ 1 + ( d θ ) n × ] x (41.15) x = [ 1 + ( d θ ) n × ] x {:(41.15)x^(')=[1+(d theta)n xx]x:}\begin{equation*} \boldsymbol{x}^{\prime}=[1+(d \theta) \boldsymbol{n} \times] \boldsymbol{x} \tag{41.15} \end{equation*}(41.15)x=[1+(dθ)n×]x
in agreement with (41.10), as was to be shown.
A finite rotation about a given axis can be considered as the composition of infinitesimal rotations about that axis. To see this composition in simplest form, rewrite the spin matrix (41.8) associated with the general infinitesimal rotation as
(41.16) R ( d θ ) = e ( i d θ / 2 ) ( σ n ) (41.16) R ( d θ ) = e ( i d θ / 2 ) ( σ n ) {:(41.16)R(d theta)=e^(-(id theta//2)(sigma*n)):}\begin{equation*} R(d \theta)=e^{-(i d \theta / 2)(\sigma \cdot n)} \tag{41.16} \end{equation*}(41.16)R(dθ)=e(idθ/2)(σn)
(exponential function defined by its power-series expansion). Note that ( σ n σ n sigma*n\boldsymbol{\sigma} \cdot \boldsymbol{n}σn ) commutes (a) with unity and (b) with itself, and in addition (c) has a unit square. Therefore the calculation of the exponential function proceeds no differently here, for spin matrices, than for everyday algebra. The composition of the spin matrices for infinitesimal rotations about an unchanging axis proceeds by adding exponents, to give
(41.17) R ( θ ) = e i ( θ / 2 ) ( σ n ) (41.17) R ( θ ) = e i ( θ / 2 ) ( σ n ) {:(41.17)R(theta)=e^(-i(theta//2)(sigma*n)):}\begin{equation*} R(\theta)=e^{-i(\theta / 2)(\sigma \cdot n)} \tag{41.17} \end{equation*}(41.17)R(θ)=ei(θ/2)(σn)
which can also be obtained immediately from equation (41.7). This expression can be put in another form by developing the power series; thus,
Composition of finite rotation from infinitesimal rotations
Rotation of a 3-vector described in spin-matrix language
R ( θ ) = p = 0 ( 1 / p ! ) ( i θ σ n / 2 ) p (41.18) = even p ( 1 / p ! ) ( i θ / 2 ) p + ( σ n ) odd p ( 1 / p ! ) ( i θ / 2 ) p = cos ( θ / 2 ) i sin ( θ / 2 ) ( σ n ) R ( θ ) = p = 0 ( 1 / p ! ) ( i θ σ n / 2 ) p (41.18) = even  p ( 1 / p ! ) ( i θ / 2 ) p + ( σ n ) odd  p ( 1 / p ! ) ( i θ / 2 ) p = cos ( θ / 2 ) i sin ( θ / 2 ) ( σ n ) {:[R(theta)=sum_(p=0)^(oo)(1//p!)(-i theta sigma*n//2)^(p)],[(41.18)=sum_("even "p)(1//p!)(-i theta//2)^(p)+(sigma*n)sum_("odd "p)(1//p!)(-i theta//2)^(p)],[=cos(theta//2)-i sin(theta//2)(sigma*n)]:}\begin{align*} R(\theta) & =\sum_{p=0}^{\infty}(1 / p!)(-i \theta \boldsymbol{\sigma} \cdot \boldsymbol{n} / 2)^{p} \\ & =\sum_{\text {even } p}(1 / p!)(-i \theta / 2)^{p}+(\boldsymbol{\sigma} \cdot \boldsymbol{n}) \sum_{\text {odd } p}(1 / p!)(-i \theta / 2)^{p} \tag{41.18}\\ & =\cos (\theta / 2)-i \sin (\theta / 2)(\boldsymbol{\sigma} \cdot \boldsymbol{n}) \end{align*}R(θ)=p=0(1/p!)(iθσn/2)p(41.18)=even p(1/p!)(iθ/2)p+(σn)odd p(1/p!)(iθ/2)p=cos(θ/2)isin(θ/2)(σn)
in agreement with the expression (41.3) originally given for a spinor transformation. The effect of one infinitesimal rotation after another after another . . . on a vector is given by
X = R ( d θ ) R ( d θ ) X R ( d θ ) R ( d θ ) X = R ( d θ ) R ( d θ ) X R ( d θ ) R ( d θ ) X^(')=R(d theta)dots R(d theta)XR^(**)(d theta)dotsR^(**)(d theta)X^{\prime}=R(d \theta) \ldots R(d \theta) X R^{*}(d \theta) \ldots R^{*}(d \theta)X=R(dθ)R(dθ)XR(dθ)R(dθ)
with the consequence that even for a finite rotation R = R ( θ ) R = R ( θ ) R=R(theta)R=R(\theta)R=R(θ) one is correct in employing the formula
(41.19) X = R X R (41.19) X = R X R {:(41.19)X^(')=RXR^(**):}\begin{equation*} X^{\prime}=R X R^{*} \tag{41.19} \end{equation*}(41.19)X=RXR

EXERCISE

Exercise 41.3. MORE PROPERTIES OF THE ROTATION MATRIX

Show that for X = x σ X = x σ X=x*sigmaX=\boldsymbol{x} \cdot \boldsymbol{\sigma}X=xσ one has the commutation relation
[ ( σ n ) , X ] = 2 i ( n × x ) σ [ ( σ n ) , X ] = 2 i ( n × x ) σ [(sigma*n),X]=2i(n xx x)*sigma[(\boldsymbol{\sigma} \cdot \boldsymbol{n}), X]=2 i(\boldsymbol{n} \times \boldsymbol{x}) \cdot \boldsymbol{\sigma}[(σn),X]=2i(n×x)σ
Use this to obtain, from equation (41.19) in the form X = R X 0 R X = R X 0 R X=RX_(0)R^(**)X=R X_{0} R^{*}X=RX0R [where X 0 X 0 X_(0)X_{0}X0 is constant, while R ( θ ) R ( θ ) R(theta)R(\theta)R(θ) is given by equation (41.17)], the formula
d d θ ( x σ ) = ( n × x ) σ . d d θ ( x σ ) = ( n × x ) σ . (d)/(d theta)(x*sigma)=(n xx x)*sigma.\frac{d}{d \theta}(x \cdot \sigma)=(n \times x) \cdot \sigma .ddθ(xσ)=(n×x)σ.
Why is this equivalent to the standard definition
d x d t = ω × x d x d t = ω × x (dx)/(dt)=omega xx x\frac{d x}{d t}=\omega \times xdxdt=ω×x
for the angular velocity? Reverse the argument to show that equation (41.7 ) correctly defines the rotation R ( t ) R ( t ) R(t)R(t)R(t) resulting from a time-dependent angular velocity ω ( t ) ω ( t ) omega(t)\omega(t)ω(t), even though the simple solution R = exp [ 1 2 i t ( σ ω ) ] R = exp 1 2 i t ( σ ω ) R=exp[-(1)/(2)it(sigma*omega)]R=\exp \left[-\frac{1}{2} i t(\boldsymbol{\sigma} \cdot \omega)\right]R=exp[12it(σω)] of this equation can no longer be written when ω ω omega\omegaω is not constant.
4-vectors and Lorentz transformations in spin-matrix language

§41.3. LORENTZ TRANSFORMATION VIA SPINOR ALGEBRA

Generate a rotation by two reflections in space? Then why not generate a Lorentz transformation by two reflections in spacetime? If for this purpose one has to turn from a real half-angle between the two planes of reflection to a complex half-angle, that development will come as no surprise; nor will it be a surprise that one can
still represent the effect of the Lorentz transformation by a matrix multiplication of the form
(41.20) X X = L X L (41.20) X X = L X L {:(41.20)X longrightarrowX^(')=LXL^(**):}\begin{equation*} X \longrightarrow X^{\prime}=L X L^{*} \tag{41.20} \end{equation*}(41.20)XX=LXL
Here the "Lorentz spin transformation matrix" L L LLL is a generalization of the rotation matrix, R R RRR. Also the "coordinate-generating spin matrix" X X XXX is now generalized from (41.11) to
(41.21) X = t + ( x σ ) (41.21) X = t + ( x σ ) {:(41.21)X=t+(x*sigma):}\begin{equation*} X=t+(x \cdot \boldsymbol{\sigma}) \tag{41.21} \end{equation*}(41.21)X=t+(xσ)
or
(41.22) X = t + z x i y x + i y t z (41.22) X = t + z x i y x + i y t z {:(41.22)X=||[t+z,x-iy],[x+iy,t-z]||:}X=\left\|\begin{array}{cc} t+z & x-i y \tag{41.22}\\ x+i y & t-z \end{array}\right\|(41.22)X=t+zxiyx+iytz
It is demanded that this matrix be Hermitian
(41.23) X = X (41.23) X = X {:(41.23)X=X^(**):}\begin{equation*} X=X^{*} \tag{41.23} \end{equation*}(41.23)X=X
Then and only then are the coordinates ( t , x , y , z ) ( t , x , y , z ) (t,x,y,z)(t, x, y, z)(t,x,y,z) real. The conjugate transpose of the transformed spin matrix must also be Hermitian-and is:
( X ) = ( L X L ) (41.24) = ( L ) ( X ) ( L ) = L X L = X . X = L X L (41.24) = L ( X ) ( L ) = L X L = X . {:[(X^('))^(**)=(LXL^(**))^(**)],[(41.24)=(L^(**))^(**)(X)^(**)(L)^(**)=LXL^(**)=X^(').]:}\begin{align*} \left(X^{\prime}\right)^{*} & =\left(L X L^{*}\right)^{*} \\ & =\left(L^{*}\right)^{*}(X)^{*}(L)^{*}=L X L^{*}=X^{\prime} . \tag{41.24} \end{align*}(X)=(LXL)(41.24)=(L)(X)(L)=LXL=X.
Therefore the new coordinates ( t , x , y , z ) t , x , y , z (t^('),x^('),y^('),z^('))\left(t^{\prime}, x^{\prime}, y^{\prime}, z^{\prime}\right)(t,x,y,z) are guaranteed to be real, as desired. This reality requirement is a rationale for the form of the spin-matrix transformation (41.20), with L L LLL appearing on one side of X X XXX and L L L^(**)L^{*}L on the other.
A Lorentz transformation is defined by the circumstance that it leaves the interval invariant:
(41.25) t 2 x 2 y 2 z 2 = t 2 x 2 y 2 z 2 (41.25) t 2 x 2 y 2 z 2 = t 2 x 2 y 2 z 2 {:(41.25)t^('2)-x^('2)-y^('2)-z^('2)=t^(2)-x^(2)-y^(2)-z^(2):}\begin{equation*} t^{\prime 2}-x^{\prime 2}-y^{\prime 2}-z^{\prime 2}=t^{2}-x^{2}-y^{2}-z^{2} \tag{41.25} \end{equation*}(41.25)t2x2y2z2=t2x2y2z2
Note that the determinant of the matrix X X XXX as written out above has the value
(41.26) det X = t 2 x 2 y 2 z 2 (41.26) det X = t 2 x 2 y 2 z 2 {:(41.26)det X=t^(2)-x^(2)-y^(2)-z^(2):}\begin{equation*} \operatorname{det} X=t^{2}-x^{2}-y^{2}-z^{2} \tag{41.26} \end{equation*}(41.26)detX=t2x2y2z2
Consequently the requirement for the preservation of the interval may be put in the form
(41.27) det X = det X (41.27) det X = det X {:(41.27)detX^(')=det X:}\begin{equation*} \operatorname{det} X^{\prime}=\operatorname{det} X \tag{41.27} \end{equation*}(41.27)detX=detX
or
(41.28) ( det L ) ( det X ) ( det L ) = det X (41.28) ( det L ) ( det X ) det L = det X {:(41.28)(det L)(det X)(detL^(**))=det X:}\begin{equation*} (\operatorname{det} L)(\operatorname{det} X)\left(\operatorname{det} L^{*}\right)=\operatorname{det} X \tag{41.28} \end{equation*}(41.28)(detL)(detX)(detL)=detX
This requirement is fulfilled by demanding
(41.29) det L = 1 (41.29) det L = 1 {:(41.29)det L=1:}\begin{equation*} \operatorname{det} L=1 \tag{41.29} \end{equation*}(41.29)detL=1
[it is not a useful generalization to multiply every element of L L LLL here by a common phase factor e i δ e i δ e^(i delta)e^{i \delta}eiδ, and therefore multiply det L L LLL by e 2 i δ e 2 i δ e^(2i delta)e^{2 i \delta}e2iδ, because the net effect of this phase factor is nil in the formula X = L X L ] X = L X L {:X^(')=LXL^(**)]\left.X^{\prime}=L X L^{*}\right]X=LXL].
Infinitesimal Lorentz transformations
Composition of finite Lorentz transformations from infinitesimal transformations
The spin matrix associated with a rotation, whether finite or infinitesimal, already satisfied the condition det L = 1 det L = 1 det L=1\operatorname{det} L=1detL=1 [proved in exercise (41.2)]. This condition, being algebraic, will continue to hold when the real angles d θ y z , d θ z x , d θ x y d θ y z , d θ z x , d θ x y dtheta_(yz),dtheta_(zx),dtheta_(xy)d \theta_{y z}, d \theta_{z x}, d \theta_{x y}dθyz,dθzx,dθxy, are replaced by complex angles, d θ y z + i d α x , d θ z x + i d α y , d θ x y + i d α z d θ y z + i d α x , d θ z x + i d α y , d θ x y + i d α z dtheta_(yz)+idalpha_(x),dtheta_(zx)+idalpha_(y),dtheta_(xy)+idalpha_(z)d \theta_{y z}+i d \alpha_{x}, d \theta_{z x}+i d \alpha_{y}, d \theta_{x y}+i d \alpha_{z}dθyz+idαx,dθzx+idαy,dθxy+idαz. The spin-transformation matrix acquires in this way a total of six parameters, as needed to describe the general infinitesimal Lorentz transformation. Thus the spin matrix for the general infinitesimal Lorentz transformation can be put in the form
L = 1 ( i / 2 ) ( σ x d θ y z + σ y d θ z x + σ z d θ x y ) (41.30) + ( 1 / 2 ) ( σ x d α x + σ y d α y + σ z d α z ) = 1 ( i d θ / 2 ) ( σ n ) + ( σ d α / 2 ) L = 1 ( i / 2 ) σ x d θ y z + σ y d θ z x + σ z d θ x y (41.30) + ( 1 / 2 ) σ x d α x + σ y d α y + σ z d α z = 1 ( i d θ / 2 ) ( σ n ) + ( σ d α / 2 ) {:[L=1-(i//2)(sigma_(x)dtheta_(yz)+sigma_(y)dtheta_(zx)+sigma_(z)dtheta_(xy))],[(41.30)+(1//2)(sigma_(x)dalpha_(x)+sigma_(y)dalpha_(y)+sigma_(z)dalpha_(z))],[=1-(id theta//2)(sigma*n)+(sigma*d alpha//2)]:}\begin{align*} L= & 1-(i / 2)\left(\sigma_{x} d \theta_{y z}+\sigma_{y} d \theta_{z x}+\sigma_{z} d \theta_{x y}\right) \\ & +(1 / 2)\left(\sigma_{x} d \alpha_{x}+\sigma_{y} d \alpha_{y}+\sigma_{z} d \alpha_{z}\right) \tag{41.30}\\ = & 1-(i d \theta / 2)(\boldsymbol{\sigma} \cdot \boldsymbol{n})+(\boldsymbol{\sigma} \cdot d \boldsymbol{\alpha} / 2) \end{align*}L=1(i/2)(σxdθyz+σydθzx+σzdθxy)(41.30)+(1/2)(σxdαx+σydαy+σzdαz)=1(idθ/2)(σn)+(σdα/2)
The effect of this transformation upon the coordinates is to be read out from the formula
X X = L X L X X = L X L X longrightarrowX^(')=LXL^(**)X \longrightarrow X^{\prime}=L X L^{*}XX=LXL
or
t + ( σ x ) = [ 1 ( i d θ / 2 ) ( σ n ) + ( σ d α / 2 ) ] (41.31) × [ t + ( σ x ) ] [ 1 + ( i d θ / 2 ) ( σ n ) + ( σ d α / 2 ) ] t + σ x = [ 1 ( i d θ / 2 ) ( σ n ) + ( σ d α / 2 ) ] (41.31) × [ t + ( σ x ) ] [ 1 + ( i d θ / 2 ) ( σ n ) + ( σ d α / 2 ) ] {:[t^(')+(sigma*x^('))=[1-(id theta//2)(sigma*n)+(sigma*d alpha//2)]],[(41.31) xx[t+(sigma*x)][1+(id theta//2)(sigma*n)+(sigma*d alpha//2)]]:}\begin{align*} t^{\prime}+\left(\boldsymbol{\sigma} \cdot \boldsymbol{x}^{\prime}\right)= & {[1-(i d \theta / 2)(\boldsymbol{\sigma} \cdot \boldsymbol{n})+(\boldsymbol{\sigma} \cdot d \boldsymbol{\alpha} / 2)] } \\ & \times[t+(\boldsymbol{\sigma} \cdot \boldsymbol{x})][1+(i d \theta / 2)(\boldsymbol{\sigma} \cdot \boldsymbol{n})+(\boldsymbol{\sigma} \cdot d \boldsymbol{\alpha} / 2)] \tag{41.31} \end{align*}t+(σx)=[1(idθ/2)(σn)+(σdα/2)](41.31)×[t+(σx)][1+(idθ/2)(σn)+(σdα/2)]
Employ equation (41.14) for ( σ A ) ( σ B ) ( σ A ) ( σ B ) (sigma*A)(sigma*B)(\boldsymbol{\sigma} \cdot \boldsymbol{A})(\boldsymbol{\sigma} \cdot \boldsymbol{B})(σA)(σB) to reduce the right side to the form
t + ( σ x ) + ( σ d α ) t + d θ ( n × x ) σ + ( x d α ) t + ( σ x ) + ( σ d α ) t + d θ ( n × x ) σ + ( x d α ) t+(sigma*x)+(sigma*d alpha)t+d theta(n xx x)*sigma+(x*d alpha)t+(\boldsymbol{\sigma} \cdot \boldsymbol{x})+(\boldsymbol{\sigma} \cdot d \boldsymbol{\alpha}) t+d \theta(\boldsymbol{n} \times \boldsymbol{x}) \cdot \boldsymbol{\sigma}+(\boldsymbol{x} \cdot d \boldsymbol{\alpha})t+(σx)+(σdα)t+dθ(n×x)σ+(xdα)
Now compare coefficients of 1 , σ x , σ y 1 , σ x , σ y 1,sigma_(x),sigma_(y)1, \boldsymbol{\sigma}_{x}, \boldsymbol{\sigma}_{y}1,σx,σy and σ z σ z sigma_(z)\boldsymbol{\sigma}_{z}σz, respectively, on both sides of the equation, and find
(41.32) t = t + ( x d α ) x = x + t d α + d θ ( n × x ) , (41.32) t = t + ( x d α ) x = x + t d α + d θ ( n × x ) , {:[(41.32)t^(')=t+(x*d alpha)],[x^(')=x+td alpha+d theta(n xx x)","]:}\begin{gather*} t^{\prime}=t+(\boldsymbol{x} \cdot d \boldsymbol{\alpha}) \tag{41.32}\\ \boldsymbol{x}^{\prime}=\boldsymbol{x}+t d \boldsymbol{\alpha}+d \theta(\boldsymbol{n} \times \boldsymbol{x}), \end{gather*}(41.32)t=t+(xdα)x=x+tdα+dθ(n×x),
in agreement with the conventional expression for an infinitesimal Lorentz transformation or "boost" of velocity d α d α d alphad \boldsymbol{\alpha}dα, in active form, as was to be shown.
The composition of such infinitesimal Lorentz transformations gives a finite Lorentz transformation. The result, however, can be calculated easily only when all infinitesimal transformations commute. Thus assume that d θ d θ d thetad \thetadθ and d α d α d alphad \boldsymbol{\alpha}dα are in a fixed ratio, so
ω n d θ d τ and a d α d τ ω n d θ d τ  and  a d α d τ omega-=n(d theta)/(d tau)" and "a-=(d alpha)/(d tau)\omega \equiv \boldsymbol{n} \frac{d \theta}{d \tau} \text { and } \boldsymbol{a} \equiv \frac{d \boldsymbol{\alpha}}{d \tau}ωndθdτ and adαdτ
are constants, with τ τ tau\tauτ a parameter. Then integration with respect to τ τ tau\tauτ (composition of infinitesimal transformations) gives a finite transformation L = exp [ 1 2 i τ σ L = exp 1 2 i τ σ L=exp[-(1)/(2)i tau sigma:}L=\exp \left[-\frac{1}{2} i \tau \boldsymbol{\sigma}\right.L=exp[12iτσ. ( ω + i a ) ] ( ω + i a ) ] (omega+ia)](\omega+i a)](ω+ia)]. For τ = 1 τ = 1 tau=1\tau=1τ=1, so θ n = ω τ , α = a τ θ n = ω τ , α = a τ theta n=omega tau,alpha=a tau\theta \boldsymbol{n}=\boldsymbol{\omega} \tau, \boldsymbol{\alpha}=\boldsymbol{a} \tauθn=ωτ,α=aτ, this reads
(41.33) L = exp [ ( α i θ n ) σ / 2 ] (41.33) L = exp [ ( α i θ n ) σ / 2 ] {:(41.33)L=exp[(alpha-i theta n)*sigma//2]:}\begin{equation*} L=\exp [(\boldsymbol{\alpha}-i \theta \boldsymbol{n}) \cdot \boldsymbol{\sigma} / 2] \tag{41.33} \end{equation*}(41.33)L=exp[(αiθn)σ/2]
In the special case of a pure boost (no rotation; θ = 0 θ = 0 theta=0\theta=0θ=0 ), the exponential function is evaluated along the lines indicated in (41.18), with the result
(41.34) L = cosh ( α / 2 ) + ( n α σ ) sinh ( α / 2 ) (41.34) L = cosh ( α / 2 ) + n α σ sinh ( α / 2 ) {:(41.34)L=cosh(alpha//2)+(n_(alpha)*sigma)sinh(alpha//2):}\begin{equation*} L=\cosh (\alpha / 2)+\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{\sigma}\right) \sinh (\alpha / 2) \tag{41.34} \end{equation*}(41.34)L=cosh(α/2)+(nασ)sinh(α/2)
Here n α = α / α n α = α / α n_(alpha)=alpha//alpha\boldsymbol{n}_{\alpha}=\boldsymbol{\alpha} / \boldsymbol{\alpha}nα=α/α is a unit vector in the direction of the boost. The corresponding Lorentz transformation itself is evaluated from the formula
X = L X L X = L X L X^(')=LXL^(**)X^{\prime}=L X L^{*}X=LXL
or
t + ( x σ ) = [ cosh α / 2 + ( n α σ ) sinh α / 2 ] [ t + ( x σ ) ] (41.35) × [ cosh α / 2 + ( n α σ ) sinh α / 2 ] . t + x σ = cosh α / 2 + n α σ sinh α / 2 [ t + ( x σ ) ] (41.35) × cosh α / 2 + n α σ sinh α / 2 . {:[t^(')+(x^(')*sigma)=[cosh alpha//2+(n_(alpha)*sigma)sinh alpha//2][t+(x*sigma)]],[(41.35) xx[cosh alpha//2+(n_(alpha)*sigma)sinh alpha//2].]:}\begin{align*} t^{\prime}+\left(\boldsymbol{x}^{\prime} \cdot \boldsymbol{\sigma}\right)= & {\left[\cosh \alpha / 2+\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{\sigma}\right) \sinh \alpha / 2\right][t+(\boldsymbol{x} \cdot \boldsymbol{\sigma})] } \\ & \times\left[\cosh \alpha / 2+\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{\sigma}\right) \sinh \alpha / 2\right] . \tag{41.35} \end{align*}t+(xσ)=[coshα/2+(nασ)sinhα/2][t+(xσ)](41.35)×[coshα/2+(nασ)sinhα/2].
Simplify with the help of the relations
cosh 2 ( α / 2 ) + sinh 2 ( α / 2 ) = cosh α 2 sinh ( α / 2 ) cosh ( α / 2 ) = sinh α cosh 2 ( α / 2 ) + sinh 2 ( α / 2 ) = cosh α 2 sinh ( α / 2 ) cosh ( α / 2 ) = sinh α {:[cosh^(2)(alpha//2)+sinh^(2)(alpha//2)=cosh alpha],[2sinh(alpha//2)cosh(alpha//2)=sinh alpha]:}\begin{gathered} \cosh ^{2}(\alpha / 2)+\sinh ^{2}(\alpha / 2)=\cosh \alpha \\ 2 \sinh (\alpha / 2) \cosh (\alpha / 2)=\sinh \alpha \end{gathered}cosh2(α/2)+sinh2(α/2)=coshα2sinh(α/2)cosh(α/2)=sinhα
and
( n α σ ) ( x σ ) ( n α σ ) = 2 ( n α x ) ( n α σ ) ( n α n α ) ( x σ ) , n α σ ( x σ ) n α σ = 2 n α x n α σ n α n α ( x σ ) , (n_(alpha)*sigma)(x*sigma)(n_(alpha)*sigma)=2(n_(alpha)*x)(n_(alpha)*sigma)-(n_(alpha)*n_(alpha))(x*sigma),\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{\sigma}\right)(\boldsymbol{x} \cdot \boldsymbol{\sigma})\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{\sigma}\right)=2\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{x}\right)\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{\sigma}\right)-\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{n}_{\alpha}\right)(\boldsymbol{x} \cdot \boldsymbol{\sigma}),(nασ)(xσ)(nασ)=2(nαx)(nασ)(nαnα)(xσ),
and on both sides of the equation compare coefficients of 1 and σ σ sigma\boldsymbol{\sigma}σ, to find
t = ( cosh α ) t + ( sinh α ) ( n α x ) , (41.36) x = [ ( sinh α ) n α t + ( cosh α ) ( n α x ) n α ] ( "in-line part of transformation") + [ x ( x n α ) n α ] ( "perpendicular part of x unchanged"). t = ( cosh α ) t + ( sinh α ) n α x , (41.36) x = ( sinh α ) n α t + ( cosh α ) n α x n α (  "in-line part of transformation")  + x x n α n α (  "perpendicular part of  x  unchanged").  {:[t^(')=(cosh alpha)t+(sinh alpha)(n_(alpha)*x)","],[(41.36)x^(')=[(sinh alpha)n_(alpha)t+(cosh alpha)(n_(alpha)*x)n_(alpha)](" "in-line part of transformation") "],[+[x-(x*n_(alpha))n_(alpha)](" "perpendicular part of "x" unchanged"). "]:}\begin{align*} t^{\prime} & =(\cosh \alpha) t+(\sinh \alpha)\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{x}\right), \\ \boldsymbol{x}^{\prime} & =\left[(\sinh \alpha) \boldsymbol{n}_{\alpha} t+(\cosh \alpha)\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{x}\right) \boldsymbol{n}_{\alpha}\right](\text { "in-line part of transformation") } \tag{41.36}\\ & +\left[\boldsymbol{x}-\left(\boldsymbol{x} \cdot \boldsymbol{n}_{\alpha}\right) \boldsymbol{n}_{\alpha}\right](\text { "perpendicular part of } \boldsymbol{x} \text { unchanged"). } \end{align*}t=(coshα)t+(sinhα)(nαx),(41.36)x=[(sinhα)nαt+(coshα)(nαx)nα]( "in-line part of transformation") +[x(xnα)nα]( "perpendicular part of x unchanged"). 
In this way one verifies that the quantity α α alpha\alphaα is the usual "velocity parameter," connected with the velocity itself by the relations
( 1 β 2 ) 1 / 2 = cosh α (41.37) β ( 1 β 2 ) 1 / 2 = sinh α β = tanh α . 1 β 2 1 / 2 = cosh α (41.37) β 1 β 2 1 / 2 = sinh α β = tanh α . {:[(1-beta^(2))^(-1//2)=cosh alpha],[(41.37)beta(1-beta^(2))^(-1//2)=sinh alpha],[beta=tanh alpha.]:}\begin{align*} \left(1-\beta^{2}\right)^{-1 / 2} & =\cosh \alpha \\ \beta\left(1-\beta^{2}\right)^{-1 / 2} & =\sinh \alpha \tag{41.37}\\ \beta & =\tanh \alpha . \end{align*}(1β2)1/2=coshα(41.37)β(1β2)1/2=sinhαβ=tanhα.
That velocity parameters add for successive boosts in the same direction shows nowhere more clearly than in the representation (41.33) of the spin-transformation matrix:
L ( α 2 ) L ( α 1 ) = exp [ α 2 ( n α σ ) / 2 ] exp [ α 1 ( n α σ ) / 2 ] = exp [ ( α 2 + α 1 ) ( n α σ ) / 2 ] (41.38) = L ( α 2 + α 1 ) . L α 2 L α 1 = exp α 2 n α σ / 2 exp α 1 n α σ / 2 = exp α 2 + α 1 n α σ / 2 (41.38) = L α 2 + α 1 . {:[L(alpha_(2))L(alpha_(1))=exp[alpha_(2)(n_(alpha)*sigma)//2]exp[alpha_(1)(n_(alpha)*sigma)//2]=exp[(alpha_(2)+alpha_(1))(n_(alpha)*sigma)//2]],[(41.38)=L(alpha_(2)+alpha_(1)).]:}\begin{align*} L\left(\alpha_{2}\right) L\left(\alpha_{1}\right) & =\exp \left[\alpha_{2}\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{\sigma}\right) / 2\right] \exp \left[\alpha_{1}\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{\sigma}\right) / 2\right]=\exp \left[\left(\alpha_{2}+\alpha_{1}\right)\left(\boldsymbol{n}_{\alpha} \cdot \sigma\right) / 2\right] \\ & =L\left(\alpha_{2}+\alpha_{1}\right) . \tag{41.38} \end{align*}L(α2)L(α1)=exp[α2(nασ)/2]exp[α1(nασ)/2]=exp[(α2+α1)(nασ)/2](41.38)=L(α2+α1).
Turn from this special case, and ask how to get the resultant of two arbitrary Lorentz transformations, each of which is a mixture of a rotation and a boost. No simpler method offers itself to answer this question than to use formula (41.33) together with the equation
(41.39) L ( resultant ) = L 2 L 1 . (41.39) L (  resultant  ) = L 2 L 1 . {:(41.39)L(" resultant ")=L_(2)L_(1).:}\begin{equation*} L(\text { resultant })=L_{2} L_{1} . \tag{41.39} \end{equation*}(41.39)L( resultant )=L2L1.

§41.4. THOMAS PRECESSION VIA SPINOR ALGEBRA

A spinning object, free of all torque, but undergoing acceleration, changes its direction as this direction is recorded in an inertial frame of reference. This is the
Thomas precession [see exercise 6.9 and first term in equation (40.33b)]. This precession accounts for a factor two in the effective energy of coupling of spin and orbital angular momentum of an atomic electron. In a nucleus it contributes a little to the coupling of the spin and orbit of a nucleon. The evaluation of the Thomas precession affords an illustration of spin-matrix methods in action.
The precession in question can be discussed quite without reference either to angular momentum or to mass in motion. It is enough to consider a sequence of inertial frames of reference S ( t ) S ( t ) S(t)S(t)S(t) with these two features. (1) To whatever point the motion has taken the mass at time t t ttt, at that point is located the origin of the frame S ( t ) S ( t ) S(t)S(t)S(t). (2) The inertial frame S ( t + d t ) S ( t + d t ) S(t+dt)S(t+d t)S(t+dt) at the next succeeding moment has undergone no rotation with respect to the inertial frame S ( t ) S ( t ) S(t)S(t)S(t), as rotation is conceived by an observer in that inertial frame. However, it has undergone a rotation ("Thomas precession") as rotation is conceived and defined in the laboratory frame of reference.
How is it possible for "no rotation" to appear as "rotation"? The answer is this: one pure boost, followed by another pure boost in another direction, does not have as net result a third pure boost; instead, the net result is a boost plus a rotation. This idea is not new in kind. Figure 41.1 illustrated how a rotation about the z z zzz-axis followed by a rotation about the x x xxx-axis had as resultant a rotation about an axis with not only an x x xxx-component and a z z zzz-component but also a y y yyy-component. What is true of rotations is true of boosts: they defy the law for the addition of vectors.
Let the frame S 0 S 0 S_(0)S_{0}S0 coincide with the laboratory frame, and let the origin of this laboratory frame be where the moving frame is at time t t ttt. Let S ( t ) S ( t ) S(t)S(t)S(t) be a Lorentz frame moving with this point at time t t ttt. Let one pure boost raise its velocity relative to the laboratory from β β beta\boldsymbol{\beta}β to β + d β β + d β beta+d beta\boldsymbol{\beta}+d \boldsymbol{\beta}β+dβ. The resulting final configuration cannot be reached from S 0 S 0 S_(0)S_{0}S0 by a pure boost. Instead, first turn S 0 S 0 S_(0)S_{0}S0 relative to the laboratory frame ("rotation R R RRR associated with the Thomas precession") and then send it by a simple boost to the final configuration. Only one choice of this rotation will be right to produce match-up. Thus, distinguishing the spin matrices for pure boosts and pure rotations by the letters B B BBB and R R RRR, one has the relation
(41.40) B ( β + d β ) R ( ω d t ) = " B ( d β ) " B ( β ) (41.40) B ( β + d β ) R ( ω d t ) = " B ( d β ) " B ( β ) {:(41.40)B(beta+d beta)R(omega dt)="B(d beta)"B(beta):}\begin{equation*} B(\boldsymbol{\beta}+d \boldsymbol{\beta}) R(\boldsymbol{\omega} d t)=" B(d \boldsymbol{\beta}) " B(\boldsymbol{\beta}) \tag{41.40} \end{equation*}(41.40)B(β+dβ)R(ωdt)="B(dβ)"B(β)
out of which to find the angular velocity ω ω omega\omegaω of the Thomas precession. The quotation marks in " B ( d β ) B ( d β ) B(d beta)B(d \boldsymbol{\beta})B(dβ) " carry a double warning: (1) the velocity of transformation that boosts S ( t ) S ( t ) S(t)S(t)S(t) to S ( t + d t ) S ( t + d t ) S(t+dt)S(t+d t)S(t+dt) is not ( β + d β ) β = d β ( β + d β ) β = d β (beta+d beta)-beta=d beta(\boldsymbol{\beta}+d \boldsymbol{\beta})-\boldsymbol{\beta}=d \boldsymbol{\beta}(β+dβ)β=dβ (law of vector addition-or subtraction-not applicable to velocity), and (2) " B ( d β ) B ( d β ) B(d beta)B(d \boldsymbol{\beta})B(dβ) " does not appear as a pure boost in the laboratory frame. It appears as a pure boost only in the comoving frame.
Take care of the second difficulty first. It is only a difficulty because the formalism for combination of transformations, R 3 = R 2 R 1 R 3 = R 2 R 1 R_(3)=R_(2)R_(1)R_{3}=R_{2} R_{1}R3=R2R1, as developed in § 41.1 § 41.1 §41.1\S 41.1§41.1 presupposes all operations R 1 , R 2 , R 1 , R 2 , R_(1),R_(2),dotsR_{1}, R_{2}, \ldotsR1,R2,, to be defined and carried out in the laboratory reference frame. In contrast, the quantity " B ( d β ) B ( d β ) B(d beta)B(d \boldsymbol{\beta})B(dβ) " is understood to imply a pure boost as defined and carried out in the comoving frame. Such an operation can be fitted into the formalism as follows. (1) Undo any velocity that the object already has. In other words apply the operator B 1 ( β ) B 1 ( β ) B^(-1)(beta)B^{-1}(\boldsymbol{\beta})B1(β). Then the object is at rest in the laboratory frame. Then apply the necessary small pure boost, B ( a comoving d τ ) B a comoving  d τ B(a_("comoving ")d tau)B\left(\boldsymbol{a}_{\text {comoving }} d \tau\right)B(acomoving dτ), where a comoving a comoving  a_("comoving ")\boldsymbol{a}_{\text {comoving }}acomoving 
is the acceleration as it will be sensed by the object and d τ d τ d taud \taudτ is the lapse of proper time as it will be sensed by the object. At the commencement of this brief acceleration the object is at rest relative to the laboratory. What is a pure boost to it is a pure boost relative to the laboratory. It is also a pure boost in the spin-matrix formalism. Then transform back from laboratory to moving frame. Thus have the relation
(41.41) " B ( d β ) " = B ( β ) B ( a comoving d τ ) B 1 ( β ) (41.41) " B ( d β ) " = B ( β ) B a comoving  d τ B 1 ( β ) {:(41.41)"B(d beta)"=B(beta)B(a_("comoving ")d tau)B^(-1)(beta):}\begin{equation*} " B(d \boldsymbol{\beta}) "=B(\beta) B\left(\boldsymbol{a}_{\text {comoving }} d \tau\right) B^{-1}(\boldsymbol{\beta}) \tag{41.41} \end{equation*}(41.41)"B(dβ)"=B(β)B(acomoving dτ)B1(β)
The equation for the determination of the Thomas precession now reads
(41.42) B ( β + d β ) R ( ω d t ) = B ( β ) B ( a comoving d τ ) (41.42) B ( β + d β ) R ( ω d t ) = B ( β ) B a comoving  d τ {:(41.42)B(beta+d beta)R(omega dt)=B(beta)B(a_("comoving ")d tau):}\begin{equation*} B(\boldsymbol{\beta}+d \boldsymbol{\beta}) R(\boldsymbol{\omega} d t)=B(\boldsymbol{\beta}) B\left(\boldsymbol{a}_{\text {comoving }} d \tau\right) \tag{41.42} \end{equation*}(41.42)B(β+dβ)R(ωdt)=B(β)B(acomoving dτ)
or, with all unknowns put on the left,
(41.43) R ( ω d t ) B 1 ( a comoving d τ ) = B 1 ( β + d β ) B ( β ) . (41.43) R ( ω d t ) B 1 a comoving  d τ = B 1 ( β + d β ) B ( β ) . {:(41.43)R(omega dt)B^(-1)(a_("comoving ")d tau)=B^(-1)(beta+d beta)B(beta).:}\begin{equation*} R(\boldsymbol{\omega} d t) B^{-1}\left(\boldsymbol{a}_{\text {comoving }} d \tau\right)=B^{-1}(\boldsymbol{\beta}+d \boldsymbol{\beta}) B(\boldsymbol{\beta}) . \tag{41.43} \end{equation*}(41.43)R(ωdt)B1(acomoving dτ)=B1(β+dβ)B(β).
The first task, to replace the erroneous value of the velocity change ( d β ) ( d β ) (d beta)(d \boldsymbol{\beta})(dβ) by a correct value ( a comoving d τ a comoving  d τ a_("comoving ")d tau\boldsymbol{a}_{\text {comoving }} d \tauacomoving dτ ), is now made part of the problem along with the evaluation of the Thomas precession itself.
Principles settled, the calculation proceeds by inserting the appropriate expressions for all four factors in (41.43), and evaluating both sides of the equation to the first order of small quantities, as follows:
1 ( i d t ω + d τ a ) σ / 2 = [ cosh ( α / 2 ) ( n α σ ) sinh ( α / 2 ) ] (41.44) × [ cosh ( α / 2 ) + ( n α σ ) sinh ( α / 2 ) ] . 1 ( i d t ω + d τ a ) σ / 2 = cosh α / 2 n α σ sinh α / 2 (41.44) × cosh ( α / 2 ) + n α σ sinh ( α / 2 ) . {:[1-(idt omega+d tau a)*sigma//2=[cosh(alpha^(')//2)-(n_(alpha^('))*sigma)sinh(alpha^(')//2)]],[(41.44) xx[cosh(alpha//2)+(n_(alpha)*sigma)sinh(alpha//2)].]:}\begin{align*} 1-(i d t \boldsymbol{\omega}+d \tau \boldsymbol{a}) \cdot \boldsymbol{\sigma} / 2= & {\left[\cosh \left(\alpha^{\prime} / 2\right)-\left(\boldsymbol{n}_{\alpha^{\prime}} \cdot \boldsymbol{\sigma}\right) \sinh \left(\alpha^{\prime} / 2\right)\right] } \\ & \times\left[\cosh (\alpha / 2)+\left(\boldsymbol{n}_{\alpha} \cdot \boldsymbol{\sigma}\right) \sinh (\alpha / 2)\right] . \tag{41.44} \end{align*}1(idtω+dτa)σ/2=[cosh(α/2)(nασ)sinh(α/2)](41.44)×[cosh(α/2)+(nασ)sinh(α/2)].
Here α α alpha\alphaα and n α n α n_(alpha)\boldsymbol{n}_{\alpha}nα are the velocity parameter and unit vector that go with the velocity β ; α = α + d α β ; α = α + d α beta;alpha^(')=alpha+d alpha\boldsymbol{\beta} ; \alpha^{\prime}=\alpha+d \alphaβ;α=α+dα, and n α = n α + d n α n α = n α + d n α n_(alpha^('))=n_(alpha)+dn_(alpha)\boldsymbol{n}_{\alpha^{\prime}}=\boldsymbol{n}_{\alpha}+d \boldsymbol{n}_{\alpha}nα=nα+dnα, go with β + d β β + d β beta+d beta\boldsymbol{\beta}+d \boldsymbol{\beta}β+dβ. Develop the righthand side of (41.44) by the methods of calculus, writing α = α + d α α = α + d α alpha^(')=alpha+d alpha\alpha^{\prime}=\alpha+d \alphaα=α+dα and n α = n α + d n α n α = n α + d n α n_(alpha^('))=n_(alpha)+dn_(alpha)\boldsymbol{n}_{\alpha^{\prime}}=\boldsymbol{n}_{\alpha}+d \boldsymbol{n}_{\alpha}nα=nα+dnα, and applying the rule for the differentiation of a product. Equate coefficients of σ / 2 σ / 2 -sigma//2-\boldsymbol{\sigma} / 2σ/2 and i σ / 2 i σ / 2 -i sigma//2-i \boldsymbol{\sigma} / 2iσ/2 on both sides of the equation. Thus find
(41.45) a comoving d τ = ( d α ) n α + ( sinh α ) d n α (41.45) a comoving  d τ = ( d α ) n α + ( sinh α ) d n α {:(41.45)a_("comoving ")d tau=(d alpha)n_(alpha)+(sinh alpha)dn_(alpha):}\begin{equation*} \boldsymbol{a}_{\text {comoving }} d \tau=(d \alpha) \boldsymbol{n}_{\alpha}+(\sinh \alpha) d \boldsymbol{n}_{\alpha} \tag{41.45} \end{equation*}(41.45)acomoving dτ=(dα)nα+(sinhα)dnα
and
(41.46) ω d t = [ 2 sinh 2 ( α / 2 ) ] d n α × n α . (41.46) ω d t = 2 sinh 2 ( α / 2 ) d n α × n α . {:(41.46)omega dt=[2sinh^(2)(alpha//2)]dn_(alpha)xxn_(alpha).:}\begin{equation*} \boldsymbol{\omega} d t=\left[2 \sinh ^{2}(\alpha / 2)\right] d \boldsymbol{n}_{\alpha} \times \boldsymbol{n}_{\alpha} . \tag{41.46} \end{equation*}(41.46)ωdt=[2sinh2(α/2)]dnα×nα.
The one expression gives the change of velocity as seen in a comoving inertial frame. The other gives the precession as seen in the laboratory frame. For low velocities the expression for the Thomas precession reduces to
(41.47) ω = a × β / 2 . (41.47) ω = a × β / 2 . {:(41.47)omega=a xx beta//2.:}\begin{equation*} \omega=a \times \beta / 2 . \tag{41.47} \end{equation*}(41.47)ω=a×β/2.
Here a a a\boldsymbol{a}a is the acceleration. Only the component perpendicular to the velocity β β beta\boldsymbol{\beta}β is relevant for the precession.
For an elementary account of the importance of the Thomas precession in atomic physics, see, for example, Ruark and Urey (1930).
Angular velocity of Thomas precession
"Orientation-entanglement relation" between a cube and the walls of a room. A 360 360 360^(@)360^{\circ}360 rotation of the cube entangles the threads. A 720 720 720^(@)720^{\circ}720 rotation might be thought to entangle them still more-but instead makes it possible completely to disen-

§41.5. SPINORS

Orientation-entanglement relation
Figure 41.5. tangle them.
Paint each face of a cube a different color. Then connect each corner of the cube to the corresponding corner of the room with an elastic thread (Figure 41.5). Now rotate the cube through 2 π = 360 2 π = 360 2pi=360^(@)2 \pi=360^{\circ}2π=360. The threads become tangled. Nothing one can do will untangle them. It is impossible for every thread to proceed on its way in a straight line. Now rotate the cube about the same axis by a further 2 π 2 π 2pi2 \pi2π. The threads become still more tangled. However, a little work now completely straightens out the tangle (Figure 41.6). Every thread runs as it did in the beginning in a straight line from its corner of the cube to the corresponding corner of the room. More generally, rotations by 0 , ± 4 π , ± 8 π , 0 , ± 4 π , ± 8 π , 0,+-4pi,+-8pi,dots0, \pm 4 \pi, \pm 8 \pi, \ldots0,±4π,±8π,, leave the cube in its standard "orienta-tion-entanglement relation" with its surroundings, whereas rotations by ± 2 π , ± 6 π ± 2 π , ± 6 π +-2pi,+-6pi\pm 2 \pi, \pm 6 \pi±2π,±6π, ± 10 π , ± 10 π , +-10 pi,dots\pm 10 \pi, \ldots±10π,, restore to the cube only its orientation, not its orientation-entanglement relation with its surroundings. Evidently there is something about the geometry of orientation that is not fully taken into account in the usual concept of orientation; hence the concept of "orientation-entanglement relation" or (briefer term!) "version" (Latin versor, turn). Whether there is also a detectable difference in the physics (contact potential between a metallic object and its metallic surroundings, for example) for two inequivalent versions of an object is not known [Aharonov and Susskind (1967)].
In keeping with the distinction between the two inequivalent versions of an object, the spin matrix associated with a rotation,
(41.48) R = cos ( θ / 2 ) i ( n σ ) sin ( θ / 2 ) , (41.48) R = cos ( θ / 2 ) i ( n σ ) sin ( θ / 2 ) , {:(41.48)R=cos(theta//2)-i(n*sigma)sin(theta//2)",":}\begin{equation*} R=\cos (\theta / 2)-i(\boldsymbol{n} \cdot \boldsymbol{\sigma}) \sin (\theta / 2), \tag{41.48} \end{equation*}(41.48)R=cos(θ/2)i(nσ)sin(θ/2),
reverses sign on a rotation through an odd multiple of 2 π 2 π 2pi2 \pi2π. This sign change never shows up in the law of transformation of a vector, as summarized in the formula
(41.49) X X = R X R (41.49) X X = R X R {:(41.49)X longrightarrowX^(')=RXR^(**):}\begin{equation*} X \longrightarrow X^{\prime}=R X R^{*} \tag{41.49} \end{equation*}(41.49)XX=RXR
(two factors R R RRR; sign change in each!). The sign change does show up when one turns from a vector to a 2 -component quantity that transforms according to the law
(41.50) ξ ξ = R ξ . (41.50) ξ ξ = R ξ . {:(41.50)xi longrightarrowxi^(')=R xi.:}\begin{equation*} \xi \longrightarrow \xi^{\prime}=R \xi . \tag{41.50} \end{equation*}(41.50)ξξ=Rξ.
Figure 41.6.
An object is connected to its surroundings by elastic threads as in Figure 41.5. (Eight are shown here; any number could be used.) Rotating the object through 720 720 720^(@)720^{\circ}720 and then following the procedure outlined (Edward McDonald) in frames 2-8 (with the object remaining fixed), one finds that the connecting threads are left disentangled, as in frame 9 (lower right).
Such a quantity is known as a spinor. A spinor reverses sign on a 360 360 360^(@)360^{\circ}360 rotation. It therefore provides a reasonable means to keep track of the difference between the two inequivalent versions of the cube. More generally, with each orientationentanglement relation between the cube and its surroundings one can associate a different value of the spinor ξ ξ xi\xiξ. Moreover, there is nothing that limits the usefulness of the spinor concept to rotations. Also, for the general combination of boost and rotation, one can write
(41.51) ξ ξ = L ξ (41.51) ξ ξ = L ξ {:(41.51)xi longrightarrowxi^(')=L xi:}\begin{equation*} \xi \longrightarrow \xi^{\prime}=L \xi \tag{41.51} \end{equation*}(41.51)ξξ=Lξ
Lorentz transformation of a spinor
When the boost and rotation are both infinitesimal, the explicit form of this transformation is simple:
ξ = [ 1 ( i d θ / 2 ) ( n σ ) + ( d β / 2 ) σ ] ξ , ξ = [ 1 ( i d θ / 2 ) ( n σ ) + ( d β / 2 ) σ ] ξ , xi^(')=[1-(id theta//2)(n*sigma)+(d beta//2)*sigma]xi,\xi^{\prime}=[1-(i d \theta / 2)(\boldsymbol{n} \cdot \boldsymbol{\sigma})+(d \boldsymbol{\beta} / 2) \cdot \boldsymbol{\sigma}] \xi,ξ=[1(idθ/2)(nσ)+(dβ/2)σ]ξ,
or, according to (41.1),
(41.52) ( ξ ξ 2 ) = 1 + 1 2 ( i θ x y + β z ) 1 2 ( i θ y z θ z x + β x i β y ) ( ξ 1 1 2 ( i θ y z + θ z x + β x + i β y ) ξ 2 ) , 1 2 ( i θ x y β z ) (41.52) ( ξ ξ 2 ) = 1 + 1 2 i θ x y + β z 1 2 i θ y z θ z x + β x i β y ξ 1 1 2 i θ y z + θ z x + β x + i β y ξ 2 , 1 2 i θ x y β z {:(41.52)((xi^('))/(xi^('2)))=||[1+(1)/(2)(-itheta_(xy)+beta_(z)),(1)/(2)(-itheta_(yz)-theta_(zx)+beta_(x)-ibeta_(y))||([xi^(1)],[(1)/(2)(-itheta_(yz)+theta_(zx)+beta_(x)+ibeta_(y))],[],[xi^(2)])","(1)/(2)(itheta_(xy)-beta_(z))]||:}\binom{\xi^{\prime}}{\xi^{\prime 2}}=\left\|\begin{array}{ll} 1+\frac{1}{2}\left(-i \theta_{x y}+\beta_{z}\right) & \frac{1}{2}\left(-i \theta_{y z}-\theta_{z x}+\beta_{x}-i \beta_{y}\right) \|\left(\begin{array}{l} \xi^{1} \\ \frac{1}{2}\left(-i \theta_{y z}+\theta_{z x}+\beta_{x}+i \beta_{y}\right) \\ \\ \xi^{2} \end{array}\right), \frac{1}{2}\left(i \theta_{x y}-\beta_{z}\right) \tag{41.52} \end{array}\right\|(41.52)(ξξ2)=1+12(iθxy+βz)12(iθyzθzx+βxiβy)(ξ112(iθyz+θzx+βx+iβy)ξ2),12(iθxyβz)
For any combination of a boost in the z z zzz-direction of any magnitude and a finite rotation about the z z zzz-axis, one has
(41.53) ( ξ ξ 2 ) = e 1 / 2 i θ x ν + 1 / 2 β z 0 0 e 1 / 2 i θ x y 1 / 2 β z ( ξ 1 ξ 2 ) . (41.53) ( ξ ξ 2 ) = e 1 / 2 i θ x ν + 1 / 2 β z 0 0 e 1 / 2 i θ x y 1 / 2 β z ( ξ 1 ξ 2 ) . {:(41.53)((xi^(''))/(xi^('2)))=||[e^(-1//2itheta_(x nu)+1//2beta_(z)),0],[0,e^(1//2itheta_(xy)-1//2beta_(z))]||((xi^(1))/(xi^(2))).:}\binom{\xi^{\prime \prime}}{\xi^{\prime 2}}=\left\|\begin{array}{cc} e^{-1 / 2 i \theta_{x \nu}+1 / 2 \beta_{z}} & 0 \tag{41.53}\\ 0 & e^{1 / 2 i \theta_{x y}-1 / 2 \beta_{z}} \end{array}\right\|\binom{\xi^{1}}{\xi^{2}} .(41.53)(ξξ2)=e1/2iθxν+1/2βz00e1/2iθxy1/2βz(ξ1ξ2).
To keep track of the two components of the spinor, it is convenient and customary to introduce a label (capital Roman letter near beginning of alphabet) that takes on the values 1 and 2 ; thus (41.51) becomes
(41.54) ξ A = L A B ξ B . (41.54) ξ A = L A B ξ B . {:(41.54)xi^('A)=L^(A)_(B)xi^(B).:}\begin{equation*} \xi^{\prime A}=L^{A}{ }_{B} \xi^{B} . \tag{41.54} \end{equation*}(41.54)ξA=LABξB.
The spinor has acquired a significance of its own through one's having pulled out half of the transformation formula
(41.55) X = L X L (41.55) X = L X L {:(41.55)X^(')=LXL^(**):}\begin{equation*} X^{\prime}=L X L^{*} \tag{41.55} \end{equation*}(41.55)X=LXL
To be able to recover this formula, one requires the other half as well. It contains the conjugate complex of the Lorentz transformation. Therefore introduce another
Second type of spinor
Vector regarded as a Hermitian second-rank spinor spinor η η eta\etaη that transforms according to the law
(41.56) η U ˙ = L ¯ U ˙ U ˙ η V ˙ (41.56) η U ˙ = L ¯ U ˙ U ˙ η V ˙ {:(41.56)eta^('U^(˙))= bar(L)_(U^(˙))^(U^(˙))eta^(V^(˙)):}\begin{equation*} \eta^{\prime \dot{U}}=\bar{L}_{\dot{U}}^{\dot{U}} \eta^{\dot{V}} \tag{41.56} \end{equation*}(41.56)ηU˙=L¯U˙U˙ηV˙
[ U ˙ = 1 ˙ , 2 ˙ ; V ˙ = 1 ˙ , 2 ; [ U ˙ = 1 ˙ , 2 ˙ ; V ˙ = 1 ˙ , 2 ; [U^(˙)=1^(˙),2^(˙);V^(˙)=1^(˙),2;[\dot{U}=\dot{1}, \dot{2} ; \dot{V}=\dot{1}, 2 ;[U˙=1˙,2˙;V˙=1˙,2; dots and capital letters near the end of the alphabet are used to distinguish components that transform according to the conjugate complex (no transpose!) of the Lorentz spin matrix].

§41.6. CORRESPONDENCE BETWEEN VECTORS AND SPINORS

To go back from spinors to vectors, note that the spin matrix X X XXX in (41.55) has the form
(41.57) X = t + ( x σ ) = ( t + z ) ( x i y ) ( x + i y ) ( t z ) = X 1 i X 1 1 ˙ X 2 i X 2 2 ˙ , (41.57) X = t + ( x σ ) = ( t + z ) ( x i y ) ( x + i y ) ( t z ) = X 1 i X 1 1 ˙ X 2 i X 2 2 ˙ , {:(41.57)X=t+(x*sigma)=||[(t+z),(x-iy)],[(x+iy),(t-z)]||=||[X^(1i),X^(11^(˙))],[X^(2i),X^(22^(˙))]||",":}X=t+(x \cdot \boldsymbol{\sigma})=\left\|\begin{array}{cc} (t+z) & (x-i y) \tag{41.57}\\ (x+i y) & (t-z) \end{array}\right\|=\left\|\begin{array}{ll} X^{1 \mathrm{i}} & X^{1 \dot{1}} \\ X^{2 \mathrm{i}} & X^{2 \dot{2}} \end{array}\right\|,(41.57)X=t+(xσ)=(t+z)(xiy)(x+iy)(tz)=X1iX11˙X2iX22˙,
where the labels receive dots or no dots according as they are coupled in (41.55) to L L L^(**)L^{*}L or to L L LLL. That equation of transformation becomes
(41.58) X A U ˙ = L A B L ¯ U ˙ V ˙ X B V ˙ (41.58) X A U ˙ = L A B L ¯ U ˙ V ˙ X B V ˙ {:(41.58)X^('AU^(˙))=L^(A)_(B) bar(L)^(U^(˙))V^(˙)X^(BV^(˙)):}\begin{equation*} X^{\prime A \dot{U}}=L^{A}{ }_{B} \bar{L}^{\dot{U}} \dot{V} X^{B \dot{V}} \tag{41.58} \end{equation*}(41.58)XAU˙=LABL¯U˙V˙XBV˙
(transpose obtained automatically by ordering of indices; thus L ¯ U ˙ V ˙ ˙ L ¯ U ˙ V ˙ ˙ bar(L)^(U^(˙))V^(˙)^(˙)\bar{L}^{\dot{U}} \dot{\dot{V}}L¯U˙V˙˙, not L U ˙ V ˙ ˙ L U ˙ V ˙ ˙ L^(**U^(˙))V^(˙)^(˙)L^{* \dot{U}} \dot{\dot{V}}LU˙V˙˙ ). The coefficients in this transformation are identical with the coefficients in the law for the transformation of a "second-rank spinor with one index undotted and the other dotted:"
(41.59) ξ A η U ˙ = L A B L ¯ U ˙ V ˙ ˙ ξ ˙ β η η ˙ (41.59) ξ A η U ˙ = L A B L ¯ U ˙ V ˙ ˙ ξ ˙ β η η ˙ {:(41.59)xi^('A)eta^('U^(˙))=L^(A)_(B) bar(L)^(U^(˙))V^(˙)^(˙)xi^(˙)^(beta)eta^(eta^(˙)):}\begin{equation*} \xi^{\prime A} \eta^{\prime \dot{U}}=L^{A}{ }_{B} \bar{L}^{\dot{U}} \dot{\dot{V}} \dot{\xi}^{\beta} \eta^{\dot{\eta}} \tag{41.59} \end{equation*}(41.59)ξAηU˙=LABL¯U˙V˙˙ξ˙βηη˙
In this sense one can say that "a 4 -vector transforms like a second-rank spinor." To be completely explicit about this connection between a 4 -vector and a second-rank spinor, note from (41.57) the relations
X 1 1 ˙ = x 0 + x 3 , X 12 = x 1 i x 2 , (41.60) X 2 1 ˙ = x 1 + i x 2 , X 2 2 ˙ = x 0 x 3 . X 1 1 ˙ = x 0 + x 3 , X 12 = x 1 i x 2 , (41.60) X 2 1 ˙ = x 1 + i x 2 , X 2 2 ˙ = x 0 x 3 . {:[X^(11^(˙))=x^(0)+x^(3)","],[X^(12)=x^(1)-ix^(2)","],[(41.60)X^(21^(˙))=x^(1)+ix^(2)","],[X^(22^(˙))=x^(0)-x^(3).]:}\begin{align*} & X^{1 \dot{1}}=x^{0}+x^{3}, \\ & X^{12}=x^{1}-i x^{2}, \\ & X^{2 \dot{1}}=x^{1}+i x^{2}, \tag{41.60}\\ & X^{2 \dot{2}}=x^{0}-x^{3} . \end{align*}X11˙=x0+x3,X12=x1ix2,(41.60)X21˙=x1+ix2,X22˙=x0x3.
In a more compact form, one has
(41.61) X A U ˙ = [ t + ( x σ ) ] A U ˙ = x μ σ μ A U ˙ (41.61) X A U ˙ = [ t + ( x σ ) ] A U ˙ = x μ σ μ A U ˙ {:(41.61)X^(AU^(˙))=[t+(x*sigma)]^(AU^(˙))=x^(mu)sigma_(mu)^(AU^(˙)):}\begin{equation*} X^{A \dot{U}}=[t+(\boldsymbol{x} \cdot \boldsymbol{\sigma})]^{A \dot{U}}=x^{\mu} \boldsymbol{\sigma}_{\mu}{ }^{A \dot{U}} \tag{41.61} \end{equation*}(41.61)XAU˙=[t+(xσ)]AU˙=xμσμAU˙
where σ 0 σ 0 sigma_(0)\sigma_{0}σ0 is the unit matrix. This equation tells immediately how to go from the components of a 4 -vector, or " 1 -index tensor," to the components of the corresponding " 1,1 -spinor" (one undotted and one dotted index).
With each real 4 -vector x α x α x^(alpha)x^{\alpha}xα is associated a 1,1 -spinor that is Hermitian in the sense that
(41.62) X A U ˙ = X U A (41.62) X A U ˙ = X U A ¯ {:(41.62)X^(AU^(˙))= bar(X^(UA)):}\begin{equation*} X^{A \dot{U}}=\overline{X^{U A}} \tag{41.62} \end{equation*}(41.62)XAU˙=XUA
An example of a Hermitian 1,1-spinor is provided by (41.61). The concept of Hermiticity can be stated in other words, and more generally. Associated with any N , N N , N N,NN, NN,N-spinor Φ Φ Phi\PhiΦ with components Φ 1 A 1 A N U ˙ 1 θ ˙ N v Φ 1 A 1 A N U ˙ 1 θ ˙ N v Phi_(1)A_(1)dotsA_(N)U^(˙)_(1)dotstheta^(˙)_(Nv)\Phi_{1} A_{1} \ldots A_{N} \dot{U}_{1} \ldots \dot{\theta}_{N v}Φ1A1ANU˙1θ˙Nv is the conjugate complex spinor ϕ ¯ ϕ ¯ bar(phi)\bar{\phi}ϕ¯ with
(41.63) ( Φ ¯ ) A 1 A N U ˙ 1 U ˙ N = ( Φ U 1 U s i ˙ 1 i ˙ x ) (41.63) ( Φ ¯ ) A 1 A N U ˙ 1 U ˙ N = Φ U 1 U s i ˙ 1 i ˙ x ¯ {:(41.63)( bar(Phi))^(A_(1)dotsA_(N)U^(˙)_(1)dotsU^(˙)_(N))= bar((Phi^(U_(1)dotsU_(s)i^(˙)_(1)dotsi^(˙)_(x)))):}\begin{equation*} (\bar{\Phi})^{A_{1} \ldots A_{N} \dot{U}_{1} \ldots \dot{U}_{N}}=\overline{\left(\Phi^{U_{1} \ldots U_{s} \dot{i}_{1} \ldots \dot{i}_{x}}\right)} \tag{41.63} \end{equation*}(41.63)(Φ¯)A1ANU˙1U˙N=(ΦU1Usi˙1i˙x)
An N , N N , N N,NN, NN,N-spinor is said to be Hermitian when it is equal to its conjugate complex.

§41.7. SPINOR ALGEBRA

Equation (41.53) showed the component ξ 1 ξ 1 xi^('1)\xi^{\prime 1}ξ1 of a spinor rising exponentially with a boost in proportion to the factor e 1 / 2 β z e 1 / 2 β z e^(1//2beta_(z))e^{1 / 2 \beta_{z}}e1/2βz, and the other component, ξ 2 ξ 2 xi^('2)\xi^{\prime 2}ξ2 falling exponentially. If from two spinors ξ ξ xi\xiξ and ζ ζ zeta\zetaζ, there is to be any quantity constructed
Spinor algebra:
N , N N , N N,N\mathrm{N}, \mathrm{N}N,N-spinors and Hermiticity
(1) ε A B , ε A B ε A B , ε A B epsi^(AB),epsi_(AB)\varepsilon^{A B}, \varepsilon_{A B}εAB,εAB defined
(2) raising and lowering spinor indices
(3) scalar products of spinors
(4) the mapping between vectors and 1,1-spinors
(5) σ μ σ μ sigma^(mu)\sigma^{\mu}σμ defined and related to σ μ σ μ sigma_(mu)\sigma_{\mu}σμ
as ξ 1 ζ 2 ξ 1 ζ 2 xi^(1)zeta^(2)\xi^{1} \zeta^{2}ξ1ζ2 and ξ 2 ζ 1 ξ 2 ζ 1 xi^(2)zeta^(1)\xi^{2} \zeta^{1}ξ2ζ1. One can restate this product prescription in other language. Introduce the alternating symbols ε A B ε A B epsi^(AB)\varepsilon^{A B}εAB and ε A B ε A B epsi_(AB)\varepsilon_{A B}εAB such that ε 12 = ε 12 = 1 ε 12 = ε 12 = 1 epsi^(12)=epsi_(12)=1\varepsilon^{12}=\varepsilon_{12}=1ε12=ε12=1 and
(41.64) ε A B = ε B A , ε A B = ε B A (41.64) ε A B = ε B A , ε A B = ε B A {:(41.64)epsi^(AB)=-epsi^(BA)","quadepsi_(AB)=-epsi_(BA):}\begin{equation*} \varepsilon^{A B}=-\varepsilon^{B A}, \quad \varepsilon_{A B}=-\varepsilon_{B A} \tag{41.64} \end{equation*}(41.64)εAB=εBA,εAB=εBA
the only other nonvanishing components being ε 21 = ε 21 = 1 ε 21 = ε 21 = 1 epsi^(21)=epsi_(21)=-1\varepsilon^{21}=\varepsilon_{21}=-1ε21=ε21=1. Define the lowerlabel spinor ξ A ξ A xi_(A)\xi_{A}ξA in terms of the upper-label spinor ξ A ξ A xi^(A)\xi^{A}ξA by the equation
(41.65) ξ A = ξ B ε B A (41.65) ξ A = ξ B ε B A {:(41.65)xi_(A)=xi^(B)epsi_(BA):}\begin{equation*} \xi_{A}=\xi^{B} \varepsilon_{B A} \tag{41.65} \end{equation*}(41.65)ξA=ξBεBA
with the inverse
(41.66) ξ B = ε B C ξ C (41.66) ξ B = ε B C ξ C {:(41.66)xi^(B)=epsi^(BC)xi_(C):}\begin{equation*} \xi^{B}=\varepsilon^{B C} \xi_{C} \tag{41.66} \end{equation*}(41.66)ξB=εBCξC
Then the scalar product of one spinor by another is defined to be
(41.67) ξ A ζ A (41.67) ξ A ζ A {:(41.67)xi_(A)zeta^(A):}\begin{equation*} \xi_{A} \zeta^{A} \tag{41.67} \end{equation*}(41.67)ξAζA
The value of this scalar product is unaffected by any boost or rotation or combination thereof:
ξ A ζ A = ξ B ε B A ζ A = ( L B D ξ D ) ε B A ( L A c ζ C ) = ( det L ) ξ D ε D C ζ C (41.68) = ξ C ζ C ξ A ζ A = ξ B ε B A ζ A = L B D ξ D ε B A L A c ζ C = ( det L ) ξ D ε D C ζ C (41.68) = ξ C ζ C {:[xi_(A)^(')zeta^('A)=xi^('B)epsi_(BA)zeta^('A)],[=(L^(B)_(D)xi^(D))epsi_(BA)(L^(A)_(c)zeta^(C))],[=(det L)xi^(D)epsi_(DC)zeta^(C)],[(41.68)=xi_(C)zeta^(C)]:}\begin{align*} \xi_{A}^{\prime} \zeta^{\prime A} & =\xi^{\prime B} \varepsilon_{B A} \zeta^{\prime A} \\ & =\left(L^{B}{ }_{D} \xi^{D}\right) \varepsilon_{B A}\left(L^{A}{ }_{c} \zeta^{C}\right) \\ & =(\operatorname{det} L) \xi^{D} \varepsilon_{D C} \zeta^{C} \\ & =\xi_{C} \zeta^{C} \tag{41.68} \end{align*}ξAζA=ξBεBAζA=(LBDξD)εBA(LAcζC)=(detL)ξDεDCζC(41.68)=ξCζC
The proof uses the fact that the expression L B D ε B A L C A ( 1 ) L B D ε B A L C A ( 1 ) L^(B)_(D)epsi_(BA)L_(C)^(A)(1)L^{B}{ }_{D} \varepsilon_{B A} L_{C}^{A}(1)LBDεBALCA(1) vanishes when D = C D = C D=CD=CD=C, and (2) reduces to the determinant of L L LLL (unity!) or its negative when D = 1 , C = 2 D = 1 , C = 2 D=1,C=2D=1, C=2D=1,C=2, or D = 2 , C = 1 D = 2 , C = 1 D=2,C=1D=2, C=1D=2,C=1. Note that the scalar product ξ A ζ A ξ A ζ A xi^(A)zeta_(A)\xi^{A} \zeta_{A}ξAζA is the negative of the scalar product ξ A ζ A ξ A ζ A xi_(A)zeta^(A)\xi_{A} \zeta^{A}ξAζA. The value of the scalar product of a spinor with itself is automatically zero ("built-in null character of a spinor").
The components of a vector with upper index have been expressed in terms of the components of a 1,1 -spinor with upper indices
(41.69) X A U ˙ = x μ σ μ A U ˙ (41.69) X A U ˙ = x μ σ μ A U ˙ {:(41.69)X^(AU^(˙))=x^(mu)sigma_(mu)^(AU^(˙)):}\begin{equation*} X^{A \dot{U}}=x^{\mu} \sigma_{\mu}^{A \dot{U}} \tag{41.69} \end{equation*}(41.69)XAU˙=xμσμAU˙
and a similar correlation holds between vector and 1,1-spinor with lower indices; thus,
(41.70) X A U ˙ = x μ σ A U ˙ (41.70) X A U ˙ = x μ σ A U ˙ {:(41.70)X_(AU^(˙))=x_(mu)sigma_(AU^(˙)):}\begin{equation*} X_{A \dot{U}}=x_{\mu} \sigma_{A \dot{U}} \tag{41.70} \end{equation*}(41.70)XAU˙=xμσAU˙
Here the "associated basic spin matrices" have the components
(41.71) σ A U ˙ μ = η μ ν σ ν B V ˙ ε B A ε V ˙ U ˙ (41.71) σ A U ˙ μ = η μ ν σ ν B V ˙ ε B A ε V ˙ U ˙ {:(41.71)sigma_(AU^(˙))^(mu)=eta^(mu nu)sigma_(nu)^(BV^(˙))epsi_(BA)epsi_(V^(˙)U^(˙)):}\begin{equation*} \sigma_{A \dot{U}}^{\mu}=\eta^{\mu \nu} \sigma_{\nu}^{B \dot{V}} \varepsilon_{B A} \varepsilon_{\dot{V} \dot{U}} \tag{41.71} \end{equation*}(41.71)σAU˙μ=ημνσνBV˙εBAεV˙U˙
or, explicitly,
(41.72) σ μ 1 i σ μ 12 σ μ 2 i σ μ 2 2 ˙ = { 1 0 0 1 for μ = 0 0 1 1 0 for μ = 1 + 0 i i 0 for μ = 2 1 0 0 1 for μ = 3 (41.72) σ μ 1 i σ μ 12 σ μ 2 i σ μ 2 2 ˙ = 1 0 0 1  for  μ = 0 0 1 1 0  for  μ = 1 + 0 i i 0  for  μ = 2 1 0 0 1  for  μ = 3 {:(41.72)||[sigma^(mu)_(1i),sigma^(mu)_(12)],[sigma^(mu)_(2i),sigma^(mu)_(22^(˙))]||={[-||[1,0],[0,1]||" for "mu=0],[-||[0,1],[1,0]||" for "mu=1],[+||[0,-i],[i,0]||" for "mu=2],[-||[1,0],[0,-1]||" for "mu=3]:}:}\left\|\begin{array}{ll} \sigma^{\mu}{ }_{1 i} & \sigma^{\mu}{ }_{12} \tag{41.72}\\ \sigma^{\mu}{ }_{2 i} & \sigma^{\mu}{ }_{2 \dot{2}} \end{array}\right\|=\left\{\begin{array}{l} -\left\|\begin{array}{ll} 1 & 0 \\ 0 & 1 \end{array}\right\| \text { for } \mu=0 \\ -\left\|\begin{array}{cc} 0 & 1 \\ 1 & 0 \end{array}\right\| \text { for } \mu=1 \\ +\left\|\begin{array}{cc} 0 & -i \\ i & 0 \end{array}\right\| \text { for } \mu=2 \\ -\left\|\begin{array}{cc} 1 & 0 \\ 0 & -1 \end{array}\right\| \text { for } \mu=3 \end{array}\right.(41.72)σμ1iσμ12σμ2iσμ22˙={1001 for μ=00110 for μ=1+0ii0 for μ=21001 for μ=3
The same type of multiplication law holds for these matrices, ( σ x ) 2 = ( σ y ) 2 = σ x 2 = σ y 2 = (sigma^(x))^(2)=(sigma^(y))^(2)=\left(\sigma^{x}\right)^{2}=\left(\sigma^{y}\right)^{2}=(σx)2=(σy)2= ( σ z ) 2 = 1 , σ x σ y = σ y σ x = i σ z σ z 2 = 1 , σ x σ y = σ y σ x = i σ z (sigma^(z))^(2)=1,sigma^(x)sigma^(y)=-sigma^(y)sigma^(x)=isigma^(z)\left(\sigma^{z}\right)^{2}=1, \sigma^{x} \sigma^{y}=-\sigma^{y} \sigma^{x}=i \sigma^{z}(σz)2=1,σxσy=σyσx=iσz, etc., as for the matrices σ x , σ y , σ z σ x , σ y , σ z sigma_(x),sigma_(y),sigma_(z)\sigma_{x}, \sigma_{y}, \sigma_{z}σx,σy,σz of (41.2). Between the "basic spin matrices," σ μ σ μ sigma_(mu)\sigma_{\mu}σμ, and the "associated basic spin matrices," σ μ σ μ sigma^(mu)\sigma^{\mu}σμ, the following orthogonality and normalization relations obtain:
(41.73) σ μ A U ˙ σ μ B V ˙ = 2 δ B A δ V U ˙ (41.73) σ μ A U ˙ σ μ B V ˙ = 2 δ B A δ V U ˙ {:(41.73)sigma_(mu)^(AU^(˙)_(sigma)^(mu))_(BV^(˙))=-2delta_(B)^(A)delta_(V)^(U^(˙)):}\begin{equation*} \sigma_{\mu}^{A \dot{U}_{\sigma}^{\mu}}{ }_{B \dot{V}}=-2 \delta_{B}^{A} \delta_{V}^{\dot{U}} \tag{41.73} \end{equation*}(41.73)σμAU˙σμBV˙=2δBAδVU˙
and
(41.74) σ μ A U ˙ σ A U ˙ = 2 δ μ ν . (41.74) σ μ A U ˙ σ A U ˙ = 2 δ μ ν . {:(41.74)sigma_(mu)^(AU^(˙)_(sigma))_(AU^(˙))=-2delta_(mu)^(nu).:}\begin{equation*} \sigma_{\mu}{ }^{A \dot{U}_{\sigma}}{ }_{A \dot{U}}=-2 \delta_{\mu}^{\nu} . \tag{41.74} \end{equation*}(41.74)σμAU˙σAU˙=2δμν.
One can use these relations to "go back from a quantity expressed as a 1,1 -spinor ('spinor equivalent of a vector') to the same quantity expressed directly as a vector (first-rank tensor)." Thus, multiply through (41.61) on both sides by 1 2 σ ν A U ˙ 1 2 σ ν A U ˙ -(1)/(2)sigma^(nu)_(AU^(˙))-\frac{1}{2} \sigma^{\nu}{ }_{A \dot{U}}12σνAU˙, sum over the spinor indices, and employ (41.74) to find the contravariant components of the vector,
(41.75) x ν = 1 2 σ A U ˙ A X A U ˙ . (41.75) x ν = 1 2 σ A U ˙ A X A U ˙ . {:(41.75)x^(nu)=-(1)/(2)sigma_(AU^(˙))_(A)X^(AU^(˙)).:}\begin{equation*} x^{\nu}=-\frac{1}{2} \sigma_{A \dot{U}}{ }_{A} X^{A \dot{U}} . \tag{41.75} \end{equation*}(41.75)xν=12σAU˙AXAU˙.
Similarly from (41.70) and (41.73) one finds the covariant components,
(41.76) x v = 1 2 σ ν A U ˙ X A U ˙ . (41.76) x v = 1 2 σ ν A U ˙ X A U ˙ . {:(41.76)x_(v)=-(1)/(2)sigma_(nu)^(AU^(˙))X_(AU^(˙)).:}\begin{equation*} x_{v}=-\frac{1}{2} \sigma_{\nu}{ }^{A \dot{U}} X_{A \dot{U}} . \tag{41.76} \end{equation*}(41.76)xv=12σνAU˙XAU˙.
An N N NNN-index tensor T T TTT lets itself be expressed in spinor language ("spinor equivalent of the tensor") by a generalization of (41.61) or (41.70); thus, for a mixed tensor of third order, one has
(41.77) T A U ˙ B V ˙ W ˙ = σ α A U ˙ β B V ˙ σ ˙ γ c W ˙ T α β γ (41.77) T A U ˙ B V ˙ W ˙ = σ α A U ˙ β B V ˙ σ ˙ γ c W ˙ T α β γ {:(41.77)T_(AU^(˙))^(BV^(˙)W^(˙))=sigma^(alpha)_(AU^(˙))_(beta)^(BV^(˙))sigma^(˙)_(gamma)^(cW^(˙))T_(alpha)^(beta gamma):}\begin{equation*} T_{A \dot{U}}{ }^{B \dot{V} \dot{W}}=\sigma^{\alpha}{ }_{A \dot{U}}{ }_{\beta}{ }^{B \dot{V}} \dot{\sigma}_{\gamma}{ }^{c \dot{W}} T_{\alpha}{ }^{\beta \gamma} \tag{41.77} \end{equation*}(41.77)TAU˙BV˙W˙=σαAU˙βBV˙σ˙γcW˙Tαβγ
and the converse relation
(41.78) T α β γ = ( 1 2 ) 3 σ α α U ˙ σ β B V ˙ σ γ C W ˙ T A U ˙ B V ˙ C W ˙ . (41.78) T α β γ = 1 2 3 σ α α U ˙ σ β B V ˙ σ γ C W ˙ T A U ˙ B V ˙ C W ˙ . {:(41.78)T_(alpha)^(beta gamma)=(-(1)/(2))^(3)sigma_(alpha)_(alpha)U^(˙)_(sigma^(beta))_(BV^(˙))sigma^(gamma)_(CW^(˙))T_(AU^(˙))^(BV^(˙)CW^(˙)).:}\begin{equation*} T_{\alpha}^{\beta \gamma}=\left(-\frac{1}{2}\right)^{3} \sigma_{\alpha}{ }_{\alpha} \dot{U}_{\sigma^{\beta}}{ }_{B \dot{V}} \sigma^{\gamma}{ }_{C \dot{W}} T_{A \dot{U}}{ }^{B \dot{V} C \dot{W}} . \tag{41.78} \end{equation*}(41.78)Tαβγ=(12)3σααU˙σβBV˙σγCW˙TAU˙BV˙CW˙.
Box 41.1 gives the spinor representation of several simple tensors.
Box 41.1 SPINOR REPRESENTATION OF CERTAIN SIMPLE TENSORS IN THE CONTEXT OF A LOCAL LORENTZ FRAME
Quantity Tensor language Spinor language
General 4-vector x α x α x^(alpha)x^{\alpha}xα (four complex numbers) X A U U X A U U X^(AUU)X^{A U U}XAUU (4 complex numbers)
Real 4-vector (example: 4-momentum) x α = x ¯ α x α = x ¯ α x^(alpha)= bar(x)^(alpha)x^{\alpha}=\bar{x}^{\alpha}xα=x¯α (four real numbers)
X A U ˙ = ( X U A ) X A U ˙ = X U A ¯ X^(AU^(˙))= bar((X^(UA)))X^{A \dot{U}}=\overline{\left(X^{U A}\right)}XAU˙=(XUA) (2 real components,
1 distinct complex component)
X^(AU^(˙))= bar((X^(UA))) (2 real components, 1 distinct complex component)| $X^{A \dot{U}}=\overline{\left(X^{U A}\right)}$ (2 real components, | | :--- | | 1 distinct complex component) |
Null 4-vector η α β x α x β = 0 η α β x α x β = 0 eta_(alpha beta)x^(alpha)x^(beta)=0\eta_{\alpha \beta} x^{\alpha} x^{\beta}=0ηαβxαxβ=0 det X A U ˙ = 0 [ det X A U ˙ = 0 [ detX^(AU^(˙))=0[\operatorname{det} X^{A \dot{U}}=0[detXAU˙=0[ see (41.57)]; hence there exist two spinors ξ A ξ A xi^(A)\xi^{A}ξA and η U η U eta^(U)\eta^{U}ηU such that X A U = ξ A η U ˙ X A U = ξ A η U ˙ X^(AU)=xi^(A)eta^(U^(˙))X^{A U}=\xi^{A} \eta^{\dot{U}}XAU=ξAηU˙.
Future-pointing real null 4 -vector (such as 4-momentum of a photon) x α = x ¯ α η α β x α x β = 0 x 0 > 0 x α = x ¯ α η α β x α x β = 0 x 0 > 0 {:[x^(alpha)= bar(x)^(alpha)],[eta_(alpha beta)x^(alpha)x^(beta)=0],[x^(0) > 0]:}\begin{aligned} & x^{\alpha}=\bar{x}^{\alpha} \\ & \eta_{\alpha \beta} x^{\alpha} x^{\beta}=0 \\ & x^{0}>0 \end{aligned}xα=x¯αηαβxαxβ=0x0>0 There exists a spinor ξ A ξ A xi^(A)\xi^{A}ξA (two complex numbers, unique up to a common multiplicative phase factor e i δ ) e i δ {:e^(i delta))\left.e^{i \delta}\right)eiδ) such that X A U = ξ A ( ξ ¯ ) U ˙ X A U = ξ A ( ξ ¯ ) U ˙ X^(AU)=xi^(A)( bar(xi))^(U^(˙))X^{A \mathscr{U}}=\xi^{A}(\bar{\xi})^{\dot{U}}XAU=ξA(ξ¯)U˙
Past-pointing real null 4 -vector x 0 < 0 x 0 < 0 x^(0) < 0x^{0}<0x0<0 X A U ˙ d = ξ A ( ξ ¯ ) U ˙ X A U ˙ = ξ A ( ξ ¯ ) U ˙ X^(A)U^(˙)^("d ")=-xi^(A)( bar(xi))^(U^(˙))X^{A} \dot{U}{ }^{\text {d }}=-\xi^{A}(\bar{\xi})^{\dot{U}}XAU˙=ξA(ξ¯)U˙.
Real bivector or 2 -form (such as Maxwell field) F [ α β ] (subscript implying F α β = F β α ; six distinct real components) F [ α β ]  (subscript implying  F α β = F β α ; six distinct real   components)  {:[F_([alpha beta])" (subscript implying "F_(alpha beta)=],[-F_(beta alpha)"; six distinct real "],[" components) "]:}\begin{aligned} & F_{[\alpha \beta]} \text { (subscript implying } F_{\alpha \beta}= \\ & -F_{\beta \alpha} \text {; six distinct real } \\ & \text { components) } \end{aligned}F[αβ] (subscript implying Fαβ=Fβα; six distinct real  components)  There exists a symmetric spinor ϕ A B ϕ A B phi_(AB)\phi_{A B}ϕAB (three distinct complex components ϕ 11 , ϕ 12 , ϕ 22 ϕ 11 , ϕ 12 , ϕ 22 phi_(11),phi_(12),phi_(22)\phi_{11}, \phi_{12}, \phi_{22}ϕ11,ϕ12,ϕ22 ) such that F A U ˙ B ˙ V ˙ = ϕ A B ε U ˙ V ˙ + ε A B ( ϕ ¯ ) U ˙ V ˙ F A U ˙ B ˙ V ˙ = ϕ A B ε U ˙ V ˙ + ε A B ( ϕ ¯ ) U ˙ V ˙ F_(AU^(˙)B^(˙)V^(˙))=phi_(AB^(epsi)U^(˙)V^(˙))+epsi_(AB)( bar(phi))_(U^(˙)V^(˙))F_{A \dot{U} \dot{B} \dot{V}}=\phi_{A B^{\varepsilon} \dot{U} \dot{V}}+\varepsilon_{A B}(\bar{\phi})_{\dot{U} \dot{V}}FAU˙B˙V˙=ϕABεU˙V˙+εAB(ϕ¯)U˙V˙
Real 2 -form dual to foregoing real 2-form F α β = 1 2 ε α β γ δ F γ δ F α β = 1 2 ε α β γ δ F γ δ ^(**)F_(alpha beta)=(1)/(2)epsi_(alpha beta gamma delta)F^(gamma delta){ }^{*} F_{\alpha \beta}=\frac{1}{2} \varepsilon_{\alpha \beta \gamma \delta} F^{\gamma \delta}Fαβ=12εαβγδFγδ
F A U B V = i ϕ A B V ˙ V + i i A B ( ϕ ¯ ) U ˙ V ˙ F A U B V = i ϕ A B V ˙ V + i i A B ( ϕ ¯ ) U ˙ V ˙ ^(**)F_(AUBV)=-iphi_(AB)^(ℓ)V^(˙)V+ii_(AB)( bar(phi))_(U^(˙)V^(˙)){ }^{*} F_{A U B V}=-i \phi_{A B}{ }^{\ell} \dot{V} V+i i_{A B}(\bar{\phi})_{\dot{U} \dot{V}}FAUBV=iϕABV˙V+iiAB(ϕ¯)U˙V˙
(duality for 2 -form corresponds to multiplication of spinor ϕ A B ϕ A B phi_(AB)\phi_{A B}ϕAB by i i -i-ii )
^(**)F_(AUBV)=-iphi_(AB)^(ℓ)V^(˙)V+ii_(AB)( bar(phi))_(U^(˙)V^(˙)) (duality for 2 -form corresponds to multiplication of spinor phi_(AB) by -i )| ${ }^{*} F_{A U B V}=-i \phi_{A B}{ }^{\ell} \dot{V} V+i i_{A B}(\bar{\phi})_{\dot{U} \dot{V}}$ | | :--- | | (duality for 2 -form corresponds to multiplication of spinor $\phi_{A B}$ by $-i$ ) |
Real fourth-order tensor with symmetries of Weyl conformal curvature tensor; that is, with symmetries of Riemann curvature tensor and with additional requirement of vanishing Ricci tensor ("empty space;" "vacuum Riemann tensor") C α β γ δ = C ( [ α β [ γ δ ] ) C α β γ δ = C ( [ α β [ γ δ ] ) C_(alpha beta gamma delta)=C_(([alpha beta||[gamma delta]))C_{\alpha \beta \gamma \delta}=C_{([\alpha \beta \|[\gamma \delta])}Cαβγδ=C([αβ[γδ]) (antisymmetric in first two indices; antisymmetric in last two indices; symmetric against interchange of first pair with second pair) C α [ β γ 8 ] = 0 C α [ β γ 8 ] = 0 C^(alpha)_([beta gamma8])=0C^{\alpha}{ }_{[\beta \gamma 8]}=0Cα[βγ8]=0 (20 algebraically distinct components, as for the Riemann tensor, reduced to 10 by the further vacuum condition:) C α β a δ = 0 C α β a δ = 0 C^(alpha)_(beta a delta)=0C^{\alpha}{ }_{\beta a \delta}=0Cαβaδ=0
There exists a completely symmetric spinor ψ A B C D ψ A B C D psi_(ABCD)\psi_{A B C D}ψABCD with five distinct complex components,
ψ 1111 ψ 1111 psi_(1111)\psi_{1111}ψ1111
ψ 1112 ψ 1112 psi_(1112)\psi_{1112}ψ1112
ψ 1122 ψ 1122 psi_(1122)\psi_{1122}ψ1122
ψ 1222 ψ 1222 psi_(1222)\psi_{1222}ψ1222
ψ 2222 ψ 2222 psi_(2222)\psi_{2222}ψ2222
such that C A A ˙ B B ˙ C W ˙ D X = C A A ˙ B B ˙ C W ˙ D X = C_(AA^(˙)BB^(˙)CW^(˙)DX)=C_{A \dot{A} B \dot{B} C \dot{W} D X}=CAA˙BB˙CW˙DX=
ψ A B C D ε U ˙ V ~ ˙ W ˙ X ˙ + ε A B ε ε C D ψ ¯ U ˙ V ˙ W ˙ X ˙ ψ A B C D ε U ˙ V ~ ˙ W ˙ X ˙ + ε A B ε ε C D ψ ¯ U ˙ V ˙ W ˙ X ˙ psi_(ABCD)epsi_(U^(˙) tilde(V)^(˙)W^(˙)X^(˙))+epsi_(AB^(epsi))epsi_(CD) bar(psi)_(U^(˙)V^(˙)W^(˙)X^(˙))\psi_{A B C D} \varepsilon_{\dot{U} \dot{\tilde{V}} \dot{W} \dot{X}}+\varepsilon_{A B^{\varepsilon}} \varepsilon_{C D} \bar{\psi}_{\dot{U} \dot{V} \dot{W} \dot{X}}ψABCDεU˙V~˙W˙X˙+εABεεCDψ¯U˙V˙W˙X˙
There exists a completely symmetric spinor psi_(ABCD) with five distinct complex components, psi_(1111) psi_(1112) psi_(1122) psi_(1222) psi_(2222) such that C_(AA^(˙)BB^(˙)CW^(˙)DX)= psi_(ABCD)epsi_(U^(˙) tilde(V)^(˙)W^(˙)X^(˙))+epsi_(AB^(epsi))epsi_(CD) bar(psi)_(U^(˙)V^(˙)W^(˙)X^(˙))| There exists a completely symmetric spinor $\psi_{A B C D}$ with five distinct complex components, | | :--- | | $\psi_{1111}$ | | $\psi_{1112}$ | | $\psi_{1122}$ | | $\psi_{1222}$ | | $\psi_{2222}$ | | such that $C_{A \dot{A} B \dot{B} C \dot{W} D X}=$ | | $\psi_{A B C D} \varepsilon_{\dot{U} \dot{\tilde{V}} \dot{W} \dot{X}}+\varepsilon_{A B^{\varepsilon}} \varepsilon_{C D} \bar{\psi}_{\dot{U} \dot{V} \dot{W} \dot{X}}$ |
Quantity Tensor language Spinor language General 4-vector x^(alpha) (four complex numbers) X^(AUU) (4 complex numbers) Real 4-vector (example: 4-momentum) x^(alpha)= bar(x)^(alpha) (four real numbers) "X^(AU^(˙))= bar((X^(UA))) (2 real components, 1 distinct complex component)" Null 4-vector eta_(alpha beta)x^(alpha)x^(beta)=0 detX^(AU^(˙))=0[ see (41.57)]; hence there exist two spinors xi^(A) and eta^(U) such that X^(AU)=xi^(A)eta^(U^(˙)). Future-pointing real null 4 -vector (such as 4-momentum of a photon) "x^(alpha)= bar(x)^(alpha) eta_(alpha beta)x^(alpha)x^(beta)=0 x^(0) > 0" There exists a spinor xi^(A) (two complex numbers, unique up to a common multiplicative phase factor {:e^(i delta)) such that X^(AU)=xi^(A)( bar(xi))^(U^(˙)) Past-pointing real null 4 -vector x^(0) < 0 X^(A)U^(˙)^("d ")=-xi^(A)( bar(xi))^(U^(˙)). Real bivector or 2 -form (such as Maxwell field) "F_([alpha beta]) (subscript implying F_(alpha beta)= -F_(beta alpha); six distinct real components) " There exists a symmetric spinor phi_(AB) (three distinct complex components phi_(11),phi_(12),phi_(22) ) such that F_(AU^(˙)B^(˙)V^(˙))=phi_(AB^(epsi)U^(˙)V^(˙))+epsi_(AB)( bar(phi))_(U^(˙)V^(˙)) Real 2 -form dual to foregoing real 2-form ^(**)F_(alpha beta)=(1)/(2)epsi_(alpha beta gamma delta)F^(gamma delta) "^(**)F_(AUBV)=-iphi_(AB)^(ℓ)V^(˙)V+ii_(AB)( bar(phi))_(U^(˙)V^(˙)) (duality for 2 -form corresponds to multiplication of spinor phi_(AB) by -i )" Real fourth-order tensor with symmetries of Weyl conformal curvature tensor; that is, with symmetries of Riemann curvature tensor and with additional requirement of vanishing Ricci tensor ("empty space;" "vacuum Riemann tensor") C_(alpha beta gamma delta)=C_(([alpha beta||[gamma delta])) (antisymmetric in first two indices; antisymmetric in last two indices; symmetric against interchange of first pair with second pair) C^(alpha)_([beta gamma8])=0 (20 algebraically distinct components, as for the Riemann tensor, reduced to 10 by the further vacuum condition:) C^(alpha)_(beta a delta)=0 "There exists a completely symmetric spinor psi_(ABCD) with five distinct complex components, psi_(1111) psi_(1112) psi_(1122) psi_(1222) psi_(2222) such that C_(AA^(˙)BB^(˙)CW^(˙)DX)= psi_(ABCD)epsi_(U^(˙) tilde(V)^(˙)W^(˙)X^(˙))+epsi_(AB^(epsi))epsi_(CD) bar(psi)_(U^(˙)V^(˙)W^(˙)X^(˙))"| Quantity | Tensor language | Spinor language | | :---: | :---: | :---: | | General 4-vector | $x^{\alpha}$ (four complex numbers) | $X^{A U U}$ (4 complex numbers) | | Real 4-vector (example: 4-momentum) | $x^{\alpha}=\bar{x}^{\alpha}$ (four real numbers) | $X^{A \dot{U}}=\overline{\left(X^{U A}\right)}$ (2 real components, <br> 1 distinct complex component) | | Null 4-vector | $\eta_{\alpha \beta} x^{\alpha} x^{\beta}=0$ | $\operatorname{det} X^{A \dot{U}}=0[$ see (41.57)]; hence there exist two spinors $\xi^{A}$ and $\eta^{U}$ such that $X^{A U}=\xi^{A} \eta^{\dot{U}}$. | | Future-pointing real null 4 -vector (such as 4-momentum of a photon) | $\begin{aligned} & x^{\alpha}=\bar{x}^{\alpha} \\ & \eta_{\alpha \beta} x^{\alpha} x^{\beta}=0 \\ & x^{0}>0 \end{aligned}$ | There exists a spinor $\xi^{A}$ (two complex numbers, unique up to a common multiplicative phase factor $\left.e^{i \delta}\right)$ such that $X^{A \mathscr{U}}=\xi^{A}(\bar{\xi})^{\dot{U}}$ | | Past-pointing real null 4 -vector | $x^{0}<0$ | $X^{A} \dot{U}{ }^{\text {d }}=-\xi^{A}(\bar{\xi})^{\dot{U}}$. | | Real bivector or 2 -form (such as Maxwell field) | $\begin{aligned} & F_{[\alpha \beta]} \text { (subscript implying } F_{\alpha \beta}= \\ & -F_{\beta \alpha} \text {; six distinct real } \\ & \text { components) } \end{aligned}$ | There exists a symmetric spinor $\phi_{A B}$ (three distinct complex components $\phi_{11}, \phi_{12}, \phi_{22}$ ) such that $F_{A \dot{U} \dot{B} \dot{V}}=\phi_{A B^{\varepsilon} \dot{U} \dot{V}}+\varepsilon_{A B}(\bar{\phi})_{\dot{U} \dot{V}}$ | | Real 2 -form dual to foregoing real 2-form | ${ }^{*} F_{\alpha \beta}=\frac{1}{2} \varepsilon_{\alpha \beta \gamma \delta} F^{\gamma \delta}$ | ${ }^{*} F_{A U B V}=-i \phi_{A B}{ }^{\ell} \dot{V} V+i i_{A B}(\bar{\phi})_{\dot{U} \dot{V}}$ <br> (duality for 2 -form corresponds to multiplication of spinor $\phi_{A B}$ by $-i$ ) | | Real fourth-order tensor with symmetries of Weyl conformal curvature tensor; that is, with symmetries of Riemann curvature tensor and with additional requirement of vanishing Ricci tensor ("empty space;" "vacuum Riemann tensor") | $C_{\alpha \beta \gamma \delta}=C_{([\alpha \beta \\|[\gamma \delta])}$ (antisymmetric in first two indices; antisymmetric in last two indices; symmetric against interchange of first pair with second pair) $C^{\alpha}{ }_{[\beta \gamma 8]}=0$ (20 algebraically distinct components, as for the Riemann tensor, reduced to 10 by the further vacuum condition:) $C^{\alpha}{ }_{\beta a \delta}=0$ | There exists a completely symmetric spinor $\psi_{A B C D}$ with five distinct complex components, <br> $\psi_{1111}$ <br> $\psi_{1112}$ <br> $\psi_{1122}$ <br> $\psi_{1222}$ <br> $\psi_{2222}$ <br> such that $C_{A \dot{A} B \dot{B} C \dot{W} D X}=$ <br> $\psi_{A B C D} \varepsilon_{\dot{U} \dot{\tilde{V}} \dot{W} \dot{X}}+\varepsilon_{A B^{\varepsilon}} \varepsilon_{C D} \bar{\psi}_{\dot{U} \dot{V} \dot{W} \dot{X}}$ |
In some treatises on spinor analysis, the factor ( 1 2 ) N 1 2 N (-(1)/(2))^(N)\left(-\frac{1}{2}\right)^{N}(12)N in equations like (41.78) is eliminated by the following double prescription: (1) insert into the matrices σ μ σ μ sigma_(mu)\sigma_{\mu}σμ and σ μ σ μ sigma^(mu)\sigma^{\mu}σμ a factor 1 / 2 1 / 2 1//sqrt21 / \sqrt{2}1/2 not included above; and (2) use for the standard metric not diag η μ ν = ( 1 , 1 , 1 , 1 ) η μ ν = ( 1 , 1 , 1 , 1 ) eta_(mu nu)=(-1,1,1,1)\eta_{\mu \nu}=(-1,1,1,1)ημν=(1,1,1,1) as above, but ( 1 , 1 , 1 , 1 ) ( 1 , 1 , 1 , 1 ) (1,-1,-1,-1)(1,-1,-1,-1)(1,1,1,1). This prescription was not adopted here (1) because the introduction of 1 / 2 1 / 2 1//sqrt21 / \sqrt{2}1/2 in the matrices σ x , σ y , σ z σ x , σ y , σ z sigma_(x),sigma_(y),sigma_(z)\sigma_{x}, \sigma_{y}, \sigma_{z}σx,σy,σz would put them out of line with the Pauli matrices as used for many years throughout
Quantity Tensor language Spinor language
Quantity Tensor language Spinor language| Quantity | Tensor language | Spinor language | | :--- | ---: | :--- |

§41.8. SPIN SPACE AND ITS BASIS SPINORS

The "space" of elementary spinors is two-dimensional. Therefore it is spanned by any two linearly independent spinors λ A λ A lambda_(A)\lambda_{A}λA and μ A μ A mu_(A)\mu_{A}μA. Moreover, it is easy to diagnose a pair of spinors for possible linear dependence, that is, for existence of a relation of the form μ A = μ A = mu_(A)=\mu_{A}=μA= const λ A λ A lambda_(A)\lambda_{A}λA. In this event, the scalar product of μ A μ A mu_(A)\mu_{A}μA with λ A λ A lambda^(A)\lambda^{A}λA, like the scalar product of λ A λ A lambda_(A)\lambda_{A}λA with λ A λ A lambda^(A)\lambda^{A}λA (41.67) automatically vanishes. Therefore a nonvanishing scalar product
(41.79) λ A μ A 0 (41.79) λ A μ A 0 {:(41.79)lambda_(A)mu^(A)!=0:}\begin{equation*} \lambda_{A} \mu^{A} \neq 0 \tag{41.79} \end{equation*}(41.79)λAμA0
is a necessary and sufficient condition for the linear independence of two spinors.
The general 4-vector lets itself be represented as a linear combination of four basis vectors. Similarly the general spinor lets itself be represented as a linear combination of two basis spinors:
(41.80) ω A = λ ξ A + μ η A . (41.80) ω A = λ ξ A + μ η A . {:(41.80)omega^(A)=lambdaxi^(A)+mueta^(A).:}\begin{equation*} \omega^{A}=\lambda \xi^{A}+\mu \eta^{A} . \tag{41.80} \end{equation*}(41.80)ωA=λξA+μηA.
Here it is understood that the term "basis spinor" implies that ξ A ξ A xi^(A)\xi^{A}ξA and η A η A eta^(A)\eta^{A}ηA satisfy the normalization condition
(41.81) ξ A η A = 1 ( = η A ξ 4 ) (41.81) ξ A η A = 1 = η A ξ 4 {:(41.81)xi_(A)eta^(A)=1(=-eta_(A)xi^(4)):}\begin{equation*} \xi_{A} \eta^{A}=1\left(=-\eta_{A} \xi^{4}\right) \tag{41.81} \end{equation*}(41.81)ξAηA=1(=ηAξ4)
From this condition one derives simple expressions for the expansion coefficients in (41.80):
λ = η A ω A ( = ω B η B ) , (41.82) μ = ξ A ω A ( = ω B ξ B ) . λ = η A ω A = ω B η B , (41.82) μ = ξ A ω A = ω B ξ B . {:[lambda=-eta_(A)omega^(A)(=omega_(B)eta^(B))","],[(41.82)mu=xi_(A)omega^(A)(=-omega_(B)xi^(B)).]:}\begin{align*} & \lambda=-\eta_{A} \omega^{A}\left(=\omega_{B} \eta^{B}\right), \\ & \mu=\xi_{A} \omega^{A}\left(=-\omega_{B} \xi^{B}\right) . \tag{41.82} \end{align*}λ=ηAωA(=ωBηB),(41.82)μ=ξAωA(=ωBξB).
Inserting these expansion coefficients back into (41.80) will reproduce any arbitrarily chosen spinor ω A ω A omega^(A)\omega^{A}ωA. In other words, the following equation has to be an identity in the components of ω B ω B omega_(B)\omega_{B}ωB :
(41.83) ω A = ε A B ω B ( ξ A η B η A ξ B ) ω B (41.83) ω A = ε A B ω B ξ A η B η A ξ B ω B {:(41.83)omega^(A)=epsi^(AB)omega_(B)-=(xi^(A)eta^(B)-eta^(A)xi^(B))omega_(B):}\begin{equation*} \omega^{A}=\varepsilon^{A B} \omega_{B} \equiv\left(\xi^{A} \eta^{B}-\eta^{A} \xi^{B}\right) \omega_{B} \tag{41.83} \end{equation*}(41.83)ωA=εABωB(ξAηBηAξB)ωB
From this circumstance, it follows that the components of the two basic spinors are linked by the equations
(41.84) ξ A η B η A ξ B = ε A B . (41.84) ξ A η B η A ξ B = ε A B . {:(41.84)xi^(A)eta^(B)-eta^(A)xi^(B)=epsi^(AB).:}\begin{equation*} \xi^{A} \eta^{B}-\eta^{A} \xi^{B}=\varepsilon^{A B} . \tag{41.84} \end{equation*}(41.84)ξAηBηAξB=εAB.
Given two basis spinors ξ A ξ A xi^(A)\xi^{A}ξA and η A η A eta^(A)\eta^{A}ηA, one can get two equally good basis spinors by writing
ξ new A = ξ A , (41.85) η A new = η A + α ξ A , ξ new  A = ξ A , (41.85) η A new  = η A + α ξ A , {:[xi_("new ")^(A)=xi^(A)","],[(41.85)eta^(A)_("new ")=eta^(A)+alphaxi^(A)","]:}\begin{align*} & \xi_{\text {new }}^{A}=\xi^{A}, \\ & \eta^{A}{ }_{\text {new }}=\eta^{A}+\alpha \xi^{A}, \tag{41.85} \end{align*}ξnew A=ξA,(41.85)ηAnew =ηA+αξA,
with α α alpha\alphaα any real or complex constant, as one checks at once by substitution into (41.81) or (41.84). The most general "spinor mate" to a given spinor ξ A ξ A xi^(A)\xi^{A}ξA, satisfying the normalization condition (41.81), has this form (41.85).
Figure 41.7.
Spinor represented by (1) "flagpole" [Penrose terminology; track of pulse of light; null vector O P O P OP\mathcal{O P}OP ] plus (2) "flag" [arrow ( P ( P (Plongrightarrow(\mathscr{P} \longrightarrow(P ) flashed onto surface of moon by laser pulse from earth or, in expanded view in the inset above, a flag itself, substituted for the arrow] plus (3) the orientation-entanglement relation between the flag and its surroundings [symbolized by strings drawn from corners of flag to surroundings]. When the spinor itself is multiplied by a factor ρ e i σ ρ e i σ rhoe^(i sigma)\rho e^{i \sigma}ρeiσ, the components of the null vector (flagpole) are multiplied by the factor ρ 2 ρ 2 rho^(2)\rho^{2}ρ2 and the flag is rotated through the angle 2 σ 2 σ 2sigma2 \sigma2σ about the flagpole.

§41.9. SPINOR VIEWED AS FLAGPOLE PLUS FLAG PLUS ORIENTATION-ENTANGLEMENT RELATION

How can one visualize a spinor? Aim the laser, pull the trigger, and send a megajoule pulse from the here and now (event O O O\mathcal{O}O ) to the there and then (event P P P\mathscr{P}P : center of the crater Aristarchus, 400 , 000 km 400 , 000 km 400,000km400,000 \mathrm{~km}400,000 km from θ θ theta\mathcal{\theta}θ in space, and 400 , 000 km 400 , 000 km 400,000km400,000 \mathrm{~km}400,000 km from θ θ theta\mathcal{\theta}θ in light-travel time). The laser has been designed to produce, not a mere spot of light, but an illuminated arrow. Following Roger Penrose, speak of the null vector O P O P OP\mathcal{O} \mathscr{P}OP as a "flagpole," and of the illuminated arrow as a "flag." A spinor (Figure 41.7) consists of this combination of (1) null flagpole plus (2) flag plus (3) the orientation-
Geometric representation of a spinor:
(1) null vector (flagpole), plus
(2) bivector (flag) and its orientation-entanglement relation
entanglement relation between the flag and its surroundings. "Rotate the flag" by repeatedly firing the laser, with a bit of rotation of the laser about its axis between one firing and the next. When the flag has turned through 360 360 360^(@)360^{\circ}360 and has come back to its original direction, the spinor has reversed sign. A rotation of the flag about the flagpole through any even multiple of 2 π 2 π 2pi2 \pi2π restores the spinor to its original value.
One goes from a spinor ξ ξ xi\xiξ, a mathematical object with two complex components ξ 1 ξ 1 xi^(1)\xi^{1}ξ1 and ξ 2 ξ 2 xi^(2)\xi^{2}ξ2, to the geometric object of "flagpole plus flag plus orientation-entanglement relation" in two steps: first the pole, then the flag. Thus, go from the spinor ξ A ξ A xi^(A)\xi^{A}ξA to the real null 4 -vector of the "pole" by way of the formula
(41.86) x α X A U ˙ = ξ A ( ξ ¯ ) U ˙ (41.86) x α X A U ˙ = ξ A ( ξ ¯ ) U ˙ {:(41.86)x^(alpha)longrightarrowX^(AU^(˙))=xi^(A)( bar(xi))^(U^(˙)):}\begin{equation*} x^{\alpha} \longrightarrow X^{A \dot{U}}=\xi^{A}(\bar{\xi})^{\dot{U}} \tag{41.86} \end{equation*}(41.86)xαXAU˙=ξA(ξ¯)U˙
or
(41.87) ( t + z ) ( x i y ) ( x + i y ) ( t z ) = ξ 1 ξ ¯ i ξ 1 ξ ¯ 2 ξ 2 ξ ¯ i ξ 2 ξ ¯ 2 (41.87) ( t + z ) ( x i y ) ( x + i y ) ( t z ) = ξ 1 ξ ¯ i ξ 1 ξ ¯ 2 ξ 2 ξ ¯ i ξ 2 ξ ¯ 2 {:(41.87)||[(t+z),(x-iy)],[(x+iy),(t-z)]||=||[xi^(1) bar(xi)^(i),xi^(1) bar(xi)^(2)],[xi^(2) bar(xi)^(i),xi^(2) bar(xi)^(2)]||:}\left\|\begin{array}{ll} (t+z) & (x-i y) \tag{41.87}\\ (x+i y) & (t-z) \end{array}\right\|=\left\|\begin{array}{cc} \xi^{1} \bar{\xi}^{i} & \xi^{1} \bar{\xi}^{2} \\ \xi^{2} \bar{\xi}^{i} & \xi^{2} \bar{\xi}^{2} \end{array}\right\|(41.87)(t+z)(xiy)(x+iy)(tz)=ξ1ξ¯iξ1ξ¯2ξ2ξ¯iξ2ξ¯2
The matrix on the right has its first row identical up to a factor ξ 1 / ξ 2 ξ 1 / ξ 2 xi^(1)//xi^(2)\xi^{1} / \xi^{2}ξ1/ξ2 with its second row. Therefore the determinant of the matrix on the right vanishes. So also for the left. Therefore the 4 -vector O P = ( t , x , y , z ) O P = ( t , x , y , z ) OP=(t,x,y,z)\mathcal{O P}=(t, x, y, z)OP=(t,x,y,z) is a null vector. One "stretches" this vector by a factor ρ 2 ρ 2 rho^(2)\rho^{2}ρ2 when one multiplies the spinor ξ A ξ A xi^(A)\xi^{A}ξA by the nonzero complex number λ = ρ e i σ λ = ρ e i σ lambda=rhoe^(i sigma)\lambda=\rho e^{i \sigma}λ=ρeiσ ( ρ , σ ρ , σ rho,sigma\rho, \sigmaρ,σ real); however, the vector is unchanged in direction. The 4 -vector is also unaffected by the choice of the angle σ σ sigma\sigmaσ. In other words, this null 4 -vector is uniquely fixed by the spinor; but the spinor is not fixed with all uniqueness by the 4 -vector. To a given 4 -vector corresponds a whole family of spinors. They differ from one another by a multiplicative phase factor of the form e i σ e i σ e^(i sigma)e^{i \sigma}eiσ ("flag factor").
Looking further to see the influence of the flag factor showing up, turn from a real vector (four components) generated out of the spinor ξ A ξ A xi^(A)\xi^{A}ξA to a real bivector (six components) generated out of the same spinor:
(41.88) F μ ν F A B U ˙ V ˙ = ξ A ξ B ϵ ˙ U ˙ V ˙ + ϵ A B ( ξ ¯ ) U ˙ ( ξ ¯ ) V ˙ , ; U ˙ ; ν B . (41.88) F μ ν F A B U ˙ V ˙ = ξ A ξ B ϵ ˙ U ˙ V ˙ + ϵ A B ( ξ ¯ ) U ˙ ( ξ ¯ ) V ˙ , ; U ˙ ; ν B . {:(41.88)F^(mu nu)longrightarrowF^(ABU^(˙)V^(˙))=xi^(A)xi^(B)epsilon^(˙)U^(˙)V^(˙):}+epsilon^(AB)( bar(xi))^(U^(˙))( bar(xi))^(V^(˙)),子;U^(˙);nu longrightarrow B.\begin{align*} F^{\mu \nu} \longrightarrow & F^{A B \dot{U} \dot{V}}=\xi^{A} \xi^{B} \dot{\epsilon} \dot{U} \dot{V} \tag{41.88} \end{align*}+\epsilon^{A B}(\bar{\xi})^{\dot{U}}(\bar{\xi})^{\dot{V}}, ~ 子 ; \dot{U} ; \nu \longrightarrow B .(41.88)FμνFABU˙V˙=ξAξBϵ˙U˙V˙+ϵAB(ξ¯)U˙(ξ¯)V˙, ;U˙;νB.
That this quantity has no more than six distinct components ( F μ ν = F ν μ F μ ν = F ν μ F^(mu nu)=-F^(nu mu)F^{\mu \nu}=-F^{\nu \mu}Fμν=Fνμ ) follows from interchanging A A AAA with B B BBB and U ˙ U ˙ U^(˙)\dot{U}U˙ with V ˙ V ˙ V^(˙)\dot{V}V˙, and noting the resultant change in sign on the righthand side of (41.88). To unfold the meaning of this bivector, look in (41.88) for every appearance of the alternating factor ε A B ε A B epsi^(AB)\varepsilon^{A B}εAB. Wherever such a factor appears, insert the expression (41.84) for this factor in terms of the starting spinor ξ A ξ A xi^(A)\xi^{A}ξA and insert the additional spinor η A η A eta^(A)\eta^{A}ηA that is needed, along with ξ A ξ A xi^(A)\xi^{A}ξA, to supply a basis for all spinors. In this way, find
F μ ν F A B U ˙ V ˙ = ξ A ξ B ( ξ ¯ U ˙ η ¯ V ˙ η ¯ U ˙ ξ ¯ V ˙ ) + ( ξ A η B η A ξ B ) ξ ¯ U ˙ ξ ¯ V ˙ (41.89) = ξ A ξ ¯ U ˙ ( ξ B η ¯ V ˙ + η B ξ ¯ V ˙ ) ( ξ A η ¯ U ˙ + η A ξ ¯ U ˙ ) ξ B ξ ¯ V ˙ = X A U ˙ Y B V ˙ Y A U ˙ X B V ˙ x μ y ν y μ x ν F μ ν F A B U ˙ V ˙ = ξ A ξ B ξ ¯ U ˙ η ¯ V ˙ η ¯ U ˙ ξ ¯ V ˙ + ξ A η B η A ξ B ξ ¯ U ˙ ξ ¯ V ˙ (41.89) = ξ A ξ ¯ U ˙ ξ B η ¯ V ˙ + η B ξ ¯ V ˙ ξ A η ¯ U ˙ + η A ξ ¯ U ˙ ξ B ξ ¯ V ˙ = X A U ˙ Y B V ˙ Y A U ˙ X B V ˙ x μ y ν y μ x ν {:[F^(mu nu)longrightarrowF^(ABU^(˙)V^(˙))=xi^(A)xi^(B)( bar(xi)^(U^(˙)) bar(eta)^(V^(˙))- bar(eta)^(U^(˙)) bar(xi)^(V^(˙)))+(xi^(A)eta^(B)-eta^(A)xi^(B)) bar(xi)^(U^(˙)) bar(xi)^(V^(˙))],[(41.89)=xi^(A) bar(xi)^(U^(˙))(xi^(B) bar(eta)^(V^(˙))+eta^(B) bar(xi)^(V^(˙)))-(xi^(A) bar(eta)^(U^(˙))+eta^(A) bar(xi)^(U^(˙)))xi^(B) bar(xi)^(V^(˙))],[=X^(AU^(˙))Y^(BV^(˙))-Y^(AU^(˙))X^(BV^(˙))longrightarrowx^(mu)y^(nu)-y^(mu)x^(nu)]:}\begin{align*} F^{\mu \nu} \longrightarrow F^{A B \dot{U} \dot{V}} & =\xi^{A} \xi^{B}\left(\bar{\xi}^{\dot{U}} \bar{\eta}^{\dot{V}}-\bar{\eta}^{\dot{U}} \bar{\xi}^{\dot{V}}\right)+\left(\xi^{A} \eta^{B}-\eta^{A} \xi^{B}\right) \bar{\xi}^{\dot{U}} \bar{\xi}^{\dot{V}} \\ & =\xi^{A} \bar{\xi}^{\dot{U}}\left(\xi^{B} \bar{\eta}^{\dot{V}}+\eta^{B} \bar{\xi}^{\dot{V}}\right)-\left(\xi^{A} \bar{\eta}^{\dot{U}}+\eta^{A} \bar{\xi}^{\dot{U}}\right) \xi^{B} \bar{\xi}^{\dot{V}} \tag{41.89}\\ & =X^{A \dot{U}} Y^{B \dot{V}}-Y^{A \dot{U}} X^{B \dot{V}} \longrightarrow x^{\mu} y^{\nu}-y^{\mu} x^{\nu} \end{align*}FμνFABU˙V˙=ξAξB(ξ¯U˙η¯V˙η¯U˙ξ¯V˙)+(ξAηBηAξB)ξ¯U˙ξ¯V˙(41.89)=ξAξ¯U˙(ξBη¯V˙+ηBξ¯V˙)(ξAη¯U˙+ηAξ¯U˙)ξBξ¯V˙=XAU˙YBV˙YAU˙XBV˙xμyνyμxν
Thus the 2,2 -spinor built from ξ A ξ A xi^(A)\xi^{A}ξA represents a bivector constructed out of the two 4 -vectors x x x\boldsymbol{x}x and y y y\boldsymbol{y}y. Of these, the first is the "real null vector of the flagpole," already seen to be determined uniquely by the spinor ξ A ξ A xi^(A)\xi^{A}ξA. The second vector,
(41.90) y α Y A U ˙ = ξ A η ¯ U ˙ + η A ξ ¯ U ˙ , (41.90) y α Y A U ˙ = ξ A η ¯ U ˙ + η A ξ ¯ U ˙ , {:(41.90)y^(alpha)longrightarrowY^(AU^(˙))=xi^(A) bar(eta)U^(˙)+eta^(A) bar(xi)U^(˙)",":}\begin{equation*} y^{\alpha} \longrightarrow Y^{A \dot{U}}=\xi^{A} \bar{\eta} \dot{U}+\eta^{A} \bar{\xi} \dot{U}, \tag{41.90} \end{equation*}(41.90)yαYAU˙=ξAη¯U˙+ηAξ¯U˙,
is also determined by ξ A ξ A xi^(A)\xi^{A}ξA, but not uniquely, because the "spinor mate," η A η A eta^(A)\eta^{A}ηA, to ξ A ξ A xi^(A)\xi^{A}ξA is not unique. Go from one choice of mate, η A η A eta^(A)\eta^{A}ηA, to a new choice of mate (equation 41.85),
(41.91) η new A = η A + α ξ A . (41.91) η new  A = η A + α ξ A . {:(41.91)eta_("new ")^(A)=eta^(A)+alphaxi^(A).:}\begin{equation*} \eta_{\text {new }}^{A}=\eta^{A}+\alpha \xi^{A} . \tag{41.91} \end{equation*}(41.91)ηnew A=ηA+αξA.
Then the real 4 -vector y μ y μ y^(mu)y^{\mu}yμ goes to the new real 4 -vector
(41.92) y new μ = y μ + ( α + α ¯ ) x μ (41.92) y new μ = y μ + ( α + α ¯ ) x μ {:(41.92)y_(new)^(mu)=y^(mu)+(alpha+ bar(alpha))x^(mu):}\begin{equation*} y_{\mathrm{new}}^{\mu}=y^{\mu}+(\alpha+\bar{\alpha}) x^{\mu} \tag{41.92} \end{equation*}(41.92)ynewμ=yμ+(α+α¯)xμ
Were the 4 -vector y y y\boldsymbol{y}y unique, there would project out from the flagpole, not a flag but an arrow. The range of values open for the real constant α + α ¯ α + α ¯ alpha+ bar(alpha)\alpha+\bar{\alpha}α+α¯ makes one arrow into many arrows, all coplanar; hence the flag of Penrose. Otherwise stated, the choice of a spinor ξ A ξ A xi^(A)\xi^{A}ξA fixes no individual arrow, but does fix the totality of the collection of arrows, and thus uniquely specifies the flag.
The 4-vector y y y\boldsymbol{y}y (and with it y new y new  y_("new ")\boldsymbol{y}_{\text {new }}ynew  ) is orthogonal to the null 4-vector x x x\boldsymbol{x}x,
(41.93) x y = x β y β = 1 2 X A U ˙ Y A U ˙ = 1 2 ξ A ξ ¯ U ˙ ( ξ ξ A η ˙ U ˙ η A ξ ¯ U ˙ ) = 0 , $ (41.93) x y = x β y β = 1 2 X A U ˙ Y A U ˙ = 1 2 ξ A ξ ¯ U ˙ ξ ξ A η ˙ U ˙ η A ξ ¯ U ˙ = 0 , $ {:(41.93){:[x*y,=x_(beta)y^(beta)=-(1)/(2)X_(AU^(˙))Y^(AU^(˙))],[,=-(1)/(2)xi_(A) bar(xi)_(U^(˙))(xixi^(A)(eta^(˙))(U^(˙)):}]eta^(A) bar(xi)^(U^(˙)))=0","$:}\left.\begin{array}{rl} \boldsymbol{x} \cdot \boldsymbol{y} & =x_{\beta} y^{\beta}=-\frac{1}{2} X_{A \dot{U}} Y^{A \dot{U}} \\ & =-\frac{1}{2} \xi_{A} \bar{\xi}_{\dot{U}}\left(\xi \xi^{A} \dot{\eta} \dot{U}\right. \tag{41.93} \end{array} \eta^{A} \bar{\xi}^{\dot{U}}\right)=0, ~ \$(41.93)xy=xβyβ=12XAU˙YAU˙=12ξAξ¯U˙(ξξAη˙U˙ηAξ¯U˙)=0, $
and spacelike,
v y = 1 2 ( ξ A η ¯ U ˙ + η A ξ ¯ U ˙ ) ( ξ A η ¯ U ˙ + η A ξ ¯ U ˙ ) (41.94) = 1 2 ( ξ A η A ) ( η ¯ U ˙ ξ ¯ U ˙ ) 1 2 ( η A ξ A ) ( ξ ¯ U ˙ η ¯ U ˙ ) = 1 v y = 1 2 ξ A η ¯ U ˙ + η A ξ ¯ U ˙ ξ A η ¯ U ˙ + η A ξ ¯ U ˙ (41.94) = 1 2 ξ A η A η ¯ U ˙ ξ ¯ U ˙ 1 2 η A ξ A ξ ¯ U ˙ η ¯ U ˙ = 1 {:[v*y=-(1)/(2)(xi_(A) bar(eta)_(U^(˙))+eta_(A) bar(xi)_(U^(˙)))(xi^(A) bar(eta)^(U^(˙))+eta^(A) bar(xi)^(U^(˙)))],[(41.94)=-(1)/(2)(xi_(A)eta^(A))( bar(eta)_(U^(˙)) bar(xi)^(U^(˙)))-(1)/(2)(eta_(A)xi^(A))( bar(xi)_(U^(˙)) bar(eta)^(U^(˙)))=1]:}\begin{align*} \boldsymbol{v} \cdot \boldsymbol{y} & =-\frac{1}{2}\left(\xi_{A} \bar{\eta}_{\dot{U}}+\eta_{A} \bar{\xi}_{\dot{U}}\right)\left(\xi^{A} \bar{\eta}^{\dot{U}}+\eta^{A} \bar{\xi}^{\dot{U}}\right) \\ & =-\frac{1}{2}\left(\xi_{A} \eta^{A}\right)\left(\bar{\eta}_{\dot{U}} \bar{\xi}^{\dot{U}}\right)-\frac{1}{2}\left(\eta_{A} \xi^{A}\right)\left(\bar{\xi}_{\dot{U}} \bar{\eta}^{\dot{U}}\right)=1 \tag{41.94} \end{align*}vy=12(ξAη¯U˙+ηAξ¯U˙)(ξAη¯U˙+ηAξ¯U˙)(41.94)=12(ξAηA)(η¯U˙ξ¯U˙)12(ηAξA)(ξ¯U˙η¯U˙)=1
("unit length of flag").
Multiplication of the spinor ξ A ξ A xi^(A)\xi^{A}ξA by the "flag factor" e i σ e i σ e^(i sigma)e^{i \sigma}eiσ rotates the flag about the flagpole by the angle 2 σ 2 σ 2sigma2 \sigma2σ, because the spinor mate, η A η A eta^(A)\eta^{A}ηA, of ξ A ξ A xi^(A)\xi^{A}ξA is multiplied by the factor e i σ e i σ e^(-i sigma)e^{-i \sigma}eiσ [see the normalization condition (41.81)]. These changes alter the vector y y y\boldsymbol{y}y to a rotated vector y rot y rot  y_("rot ")\boldsymbol{y}_{\text {rot }}yrot , with
y rot α Y rot A O ˙ = e 2 i σ ξ A η ¯ U ˙ + e 2 i σ η A ξ ¯ U ˙ = cos 2 σ ( ξ A η ¯ U ˙ + η A ξ ¯ U ˙ ) + sin 2 σ ( i ξ A η ¯ U ˙ i η A ξ ¯ U ˙ ) (41.95) y α cos 2 σ + z α sin 2 σ . y rot α Y rot A O ˙ = e 2 i σ ξ A η ¯ U ˙ + e 2 i σ η A ξ ¯ U ˙ = cos 2 σ ξ A η ¯ U ˙ + η A ξ ¯ U ˙ + sin 2 σ i ξ A η ¯ U ˙ i η A ξ ¯ U ˙ (41.95) y α cos 2 σ + z α sin 2 σ . {:[y_(rot)^(alpha)longrightarrowY_(rot)^(AO^(˙))=e^(2i sigmaxi^(A) bar(eta)^(U^(˙))+e^(-2i sigma)eta^(A) bar(xi)^(U^(˙)))],[=cos 2sigma(xi^(A) bar(eta)^(U^(˙))+eta^(A) bar(xi)^(U^(˙)))+sin 2sigma(ixi^(A) bar(eta)^(U^(˙))-ieta^(A) bar(xi)^(U^(˙)))],[(41.95) longrightarrowy^(alpha)cos 2sigma+z^(alpha)sin 2sigma.]:}\begin{align*} y_{\mathrm{rot}}^{\alpha} \longrightarrow Y_{\mathrm{rot}}^{A \dot{O}}= & e^{2 i \sigma \xi^{A} \bar{\eta}^{\dot{U}}+e^{-2 i \sigma} \eta^{A} \bar{\xi}^{\dot{U}}} \\ = & \cos 2 \sigma\left(\xi^{A} \bar{\eta}^{\dot{U}}+\eta^{A} \bar{\xi}^{\dot{U}}\right)+\sin 2 \sigma\left(i \xi^{A} \bar{\eta}^{\dot{U}}-i \eta^{A} \bar{\xi}^{\dot{U}}\right) \\ & \longrightarrow y^{\alpha} \cos 2 \sigma+z^{\alpha} \sin 2 \sigma . \tag{41.95} \end{align*}yrotαYrotAO˙=e2iσξAη¯U˙+e2iσηAξ¯U˙=cos2σ(ξAη¯U˙+ηAξ¯U˙)+sin2σ(iξAη¯U˙iηAξ¯U˙)(41.95)yαcos2σ+zαsin2σ.
Here the 4 -vector z z z\boldsymbol{z}z shares with the vector y y y\boldsymbol{y}y these properties: it is (1) real, (2) spacelike, (3) of unit magnitude, (4) orthogonal to the null 4 -vector x x x\boldsymbol{x}x of the flagpole, and (5) uniquely specified by the original spinor ξ A ξ A xi^(A)\xi^{A}ξA up to the additive real
Rotation of flag about flagpole
Equations relating spinor, flagpole, and flag
multiple ( α + α ¯ ) ( α + α ¯ ) (alpha+ bar(alpha))(\alpha+\bar{\alpha})(α+α¯) of x x x\boldsymbol{x}x. In addition, z z z\boldsymbol{z}z and y y y\boldsymbol{y}y are orthogonal. Thus y y y\boldsymbol{y}y and z z z\boldsymbol{z}z provide basis vectors in the two-dimensional space in which-to overpictorialize-the "tip of the flag" undergoes its rotation.
Recapitulate by returning to the laser pulse. Two numbers, such as the familiar polar angles θ θ theta\thetaθ (angle with the z z zzz-axis) and ϕ ϕ phi\phiϕ (azimuth around z z zzz-axis from x x xxx-axis) tell the direction of its flight. A third number, r r rrr, gives the distance to the moon and also the travel time for light to reach the moon. A fourth number, an angle ψ ψ psi\psiψ, tells the azimuth of the illuminated arrow shot onto the surface of the moon, this azimuth to be measured from the e θ e θ e_(theta)e_{\theta}eθ direction (where ψ = 0 ψ = 0 psi=0\psi=0ψ=0 ), around the flagpole in a righthanded sense. Then the spinor associated with the flagpole plus flag (rotated arrow) is
(41.96) ( ξ 1 ξ 2 ) = ( 2 r ) 1 / 2 ( cos ( θ / 2 ) e i ϕ / 2 + i ψ / 2 sin ( θ / 2 ) e i ϕ / 2 + i ψ / 2 ) (41.96) ( ξ 1 ξ 2 ) = ( 2 r ) 1 / 2 ( cos ( θ / 2 ) e i ϕ / 2 + i ψ / 2 sin ( θ / 2 ) e i ϕ / 2 + i ψ / 2 ) {:(41.96)((xi^(1))/(xi^(2)))=(2r)^(1//2)((cos(theta//2)e^(-i phi//2+i psi//2))/(sin(theta//2)e^(i phi//2+i psi//2))):}\begin{equation*} \binom{\xi^{1}}{\xi^{2}}=(2 r)^{1 / 2}\binom{\cos (\theta / 2) e^{-i \phi / 2+i \psi / 2}}{\sin (\theta / 2) e^{i \phi / 2+i \psi / 2}} \tag{41.96} \end{equation*}(41.96)(ξ1ξ2)=(2r)1/2(cos(θ/2)eiϕ/2+iψ/2sin(θ/2)eiϕ/2+iψ/2)
according to the conventions adopted here [see (41.87)]. The mate η A η A eta_(A)\eta_{A}ηA to this spinor, unique up to an additive multiple of ξ A ξ A xi^(A)\xi^{A}ξA, is
(41.97) ( η 1 η 2 ) = ( 2 r ) 1 / 2 ( sin ( θ / 2 ) e i ϕ / 2 i ψ / 2 cos ( θ / 2 ) e i ϕ / 2 i ψ / 2 ) . (41.97) ( η 1 η 2 ) = ( 2 r ) 1 / 2 ( sin ( θ / 2 ) e i ϕ / 2 i ψ / 2 cos ( θ / 2 ) e i ϕ / 2 i ψ / 2 ) . {:(41.97)((eta^(1))/(eta^(2)))=(2r)^(-1//2)((-sin(theta//2)e^(-i phi//2-i psi//2))/(cos(theta//2)e^(i phi//2-i psi//2))).:}\begin{equation*} \binom{\eta^{1}}{\eta^{2}}=(2 r)^{-1 / 2}\binom{-\sin (\theta / 2) e^{-i \phi / 2-i \psi / 2}}{\cos (\theta / 2) e^{i \phi / 2-i \psi / 2}} . \tag{41.97} \end{equation*}(41.97)(η1η2)=(2r)1/2(sin(θ/2)eiϕ/2iψ/2cos(θ/2)eiϕ/2iψ/2).
The 4 -vector of the flagpole determined by ξ 4 ξ 4 xi^(4)\xi^{4}ξ4 is found from (41.87):
(41.98) ( x 0 x 1 x 2 x 3 ) = ( r r sin θ cos ϕ r sin θ sin ϕ r cos θ ) . (41.98) x 0 x 1 x 2 x 3 = r r sin θ cos ϕ r sin θ sin ϕ r cos θ . {:(41.98)([x^(0)],[x^(1)],[x^(2)],[x^(3)])=([r],[r sin theta cos phi],[r sin theta sin phi],[r cos theta]).:}\left(\begin{array}{l} x^{0} \tag{41.98}\\ x^{1} \\ x^{2} \\ x^{3} \end{array}\right)=\left(\begin{array}{c} r \\ r \sin \theta \cos \phi \\ r \sin \theta \sin \phi \\ r \cos \theta \end{array}\right) .(41.98)(x0x1x2x3)=(rrsinθcosϕrsinθsinϕrcosθ).
To determine the flag itself, one requires, in addition to x α x α x^(alpha)x^{\alpha}xα, the unit spacelike 4-vector y α y α y^(alpha)y^{\alpha}yα, normal to x α x α x^(alpha)x^{\alpha}xα, and unique up to an additive real multiple of x α x α x^(alpha)x^{\alpha}xα. This vector is evaluated by use of (41.90) and has the form
(41.99) ( y 0 y 1 y 2 y 3 ) = ( 0 cos θ cos ϕ cos ψ + sin ϕ sin ψ cos θ sin ϕ cos ψ cos ϕ sin ψ sin θ cos ψ ) . (41.99) y 0 y 1 y 2 y 3 = 0 cos θ cos ϕ cos ψ + sin ϕ sin ψ cos θ sin ϕ cos ψ cos ϕ sin ψ sin θ cos ψ . {:(41.99)([y^(0)],[y^(1)],[y^(2)],[y^(3)])=([0],[cos theta cos phi cos psi+sin phi sin psi],[cos theta sin phi cos psi-cos phi sin psi],[-sin theta cos psi]).:}\left(\begin{array}{c} y^{0} \tag{41.99}\\ y^{1} \\ y^{2} \\ y^{3} \end{array}\right)=\left(\begin{array}{c} 0 \\ \cos \theta \cos \phi \cos \psi+\sin \phi \sin \psi \\ \cos \theta \sin \phi \cos \psi-\cos \phi \sin \psi \\ -\sin \theta \cos \psi \end{array}\right) .(41.99)(y0y1y2y3)=(0cosθcosϕcosψ+sinϕsinψcosθsinϕcosψcosϕsinψsinθcosψ).
From these expressions for x μ x μ x^(mu)x^{\mu}xμ and y μ y μ y^(mu)y^{\mu}yμ, one calculates the components of the bivector ("flag") F μ ν = x μ y ν y μ x ν F μ ν = x μ y ν y μ x ν F^(mu nu)=x^(mu)y^(nu)-y^(mu)x^(nu)F^{\mu \nu}=x^{\mu} y^{\nu}-y^{\mu} x^{\nu}Fμν=xμyνyμxν by simple arithmetic.

§41.10. APPEARANCE OF THE NIGHT SKY: AN APPLICATION OF SPINORS

Attention has gone here to extracting all four pieces of information contained in a spinor: separation in time (equal to separation in space), direction in space, and
Figure 41.8.
Representation of a direction in space (one of the stars of the Big Dipper, regarded as a point on the celestial sphere) as a point in the complex ζ ζ zeta\zetaζ plane ( ζ = ( ζ = (zeta=(\zeta=(ζ= ratio ξ 2 / ξ 1 ξ 2 / ξ 1 xi^(2)//xi^(1)\xi^{2} / \xi^{1}ξ2/ξ1 of spinor components) by stereographic projection from the South Pole.
rotation about that direction. Turn now to an application where not all that information is needed. Look at the night sky and ask (1) how to describe its appearance and (2) how to change that appearance. As one way to describe its appearance, give the direction of each star. Abandon any concern about the distance of the star, and any concern about any rotation ψ ψ psi\psiψ about the flagpole. In other words, the complex factor
( 2 r ) 1 / 2 e i ψ / 2 ( 2 r ) 1 / 2 e i ψ / 2 (2r)^(1//2)e^(i psi//2)(2 r)^{1 / 2} e^{i \psi / 2}(2r)1/2eiψ/2
common to ξ 1 ξ 1 xi^(1)\xi^{1}ξ1 and ξ 2 ξ 2 xi^(2)\xi^{2}ξ2 drops from attention. All that is left as significant is the ratio ζ ζ zeta\zetaζ of these spinor components:
(41.100) ζ = ξ 2 / ξ 1 = tan ( θ / 2 ) e i ϕ . (41.100) ζ = ξ 2 / ξ 1 = tan ( θ / 2 ) e i ϕ . {:(41.100)zeta=xi^(2)//xi^(1)=tan(theta//2)e^(i phi).:}\begin{equation*} \zeta=\xi^{2} / \xi^{1}=\tan (\theta / 2) e^{i \phi} . \tag{41.100} \end{equation*}(41.100)ζ=ξ2/ξ1=tan(θ/2)eiϕ.
To give the one complex number ζ ζ zeta\zetaζ ("stereographic coordinate;" Figure 41.8) for each star in the sky is to catalog the pattern of the stars.
Let the observer change his stance. The celestial sphere appears to rotate. Or let him rocket past his present location in the direction of the North Star with some substantial fraction of the velocity of light. To him all that portion of the celestial sphere is opened out, as if by a magnifying glass. To compensate, the remaining stars are packed into a smaller angular compass. Any such rotation or boost or combination of rotation and boost being described in spinor language by a transformation of the form
(41.101) ξ A ξ new A = L B A ξ B B , (41.101) ξ A ξ new  A = L B A ξ B B , {:(41.101)xi^(A)longrightarrowxi_("new ")^(A)=L_(B)^(A)xi_(B)^(B)",":}\begin{equation*} \xi^{A} \longrightarrow \xi_{\text {new }}^{A}=L_{B}^{A} \xi_{B}^{B}, \tag{41.101} \end{equation*}(41.101)ξAξnew A=LBAξBB,
Spinors used to analyze "Lorentz transformations" of appearance of night sky
implies a transformation of the complex stereographic coordinate of any given star of the form
(41.102) ζ ζ new = ξ new 2 ξ new 1 = L 2 2 ζ + L 2 L 2 1 ζ + L 1 1 (41.102) ζ ζ new  = ξ new  2 ξ new  1 = L 2 2 ζ + L 2 L 2 1 ζ + L 1 1 {:(41.102)zeta longrightarrowzeta_("new ")=(xi_("new ")^(2))/(xi_("new ")^(1))=(L_(2)^(2)zeta+L^(2))/(L_(2)^(1)zeta+L_(1)^(1)):}\begin{equation*} \zeta \longrightarrow \zeta_{\text {new }}=\frac{\xi_{\text {new }}^{2}}{\xi_{\text {new }}^{1}}=\frac{L_{2}^{2} \zeta+L^{2}}{L_{2}^{1} \zeta+L_{1}^{1}} \tag{41.102} \end{equation*}(41.102)ζζnew =ξnew 2ξnew 1=L22ζ+L2L21ζ+L11
In the special case of a boost in the z z zzz-direction with velocity parameter α α alpha\alphaα (velocity β = tanh α β = tanh α beta=tanh alpha\beta=\tanh \alphaβ=tanhα ), the off-diagonal components L 1 2 L 1 2 L^(1)_(2)L^{1}{ }_{2}L12 and L 2 1 L 2 1 L^(2)_(1)L^{2}{ }_{1}L21 vanish. The magnification of the overhead sky then expresses itself in the simple formula
ζ new = e α ζ ζ new = e α ζ zeta_(new)=e^(alpha)zeta\zeta_{\mathrm{new}}=e^{\alpha} \zetaζnew=eαζ
or
ϕ new = ϕ (41.103) tan ( θ new / 2 ) = e α tan ( θ / 2 ) ϕ new  = ϕ (41.103) tan θ new  / 2 = e α tan ( θ / 2 ) {:[phi_("new ")=phi],[(41.103)tan(theta_("new ")//2)=e^(alpha)tan(theta//2)]:}\begin{gather*} \phi_{\text {new }}=\phi \\ \tan \left(\theta_{\text {new }} / 2\right)=e^{\alpha} \tan (\theta / 2) \tag{41.103} \end{gather*}ϕnew =ϕ(41.103)tan(θnew /2)=eαtan(θ/2)
Contrary to this prediction and false expectation, no magnification at all is achieved of the regions around the North Star by moving with high velocity in that direction. On the contrary, any photon coming in from a star a little off that direction, with a little transverse momentum, keeps that transverse momentum in the new frame; but its longitudinal momentum against the observer is augmented by his motion. Thus the ratio of the momenta is decreased, and the observed angle relative to the North Star is also decreased. The consequence is not magnification, but diminution ("looking through the wrong end of a telescope"). The correct formula is not (41.103) but
(41.104) tan ( θ new / 2 ) = e α tan ( θ / 2 ) (41.104) tan θ new  / 2 = e α tan ( θ / 2 ) {:(41.104)tan(theta_("new ")//2)=e^(-alpha)tan(theta//2):}\begin{equation*} \tan \left(\theta_{\text {new }} / 2\right)=e^{-\alpha} \tan (\theta / 2) \tag{41.104} \end{equation*}(41.104)tan(θnew /2)=eαtan(θ/2)
(reversal of the sign of α α alpha\alphaα ). The reason for this correction is not far to seek. The spinor analysis so far had dealt with an outgoing light pulse, and a 4 -vector with positive time component. That feature was built into the formula adopted to tie the spinor to the 4 -vector,
(41.105) r 1 + ( r σ ) X = ξ A ξ ¯ U ˙ . (41.105) r 1 + ( r σ ) X = ξ A ξ ¯ U ˙ . {:(41.105)r1+(r*sigma)-=X=||xi^(A) bar(xi)^(U^(˙))||.:}\begin{equation*} r \mathbf{1}+(\boldsymbol{r} \cdot \boldsymbol{\sigma}) \equiv X=\left\|\xi^{\boldsymbol{A}} \bar{\xi}^{\dot{U}}\right\| . \tag{41.105} \end{equation*}(41.105)r1+(rσ)X=ξAξ¯U˙.
In contrast the 4 -vector that reaches back to the origin of an incoming photon has a time component that is negative (or, alternatively, sign-reversed space components)! For any null 4 -vector with negative time component, one employs instead of (41.105) the formula
(41.106) X = ξ ξ A ξ ¯ U ˙ . (41.106) X = ξ ξ A ξ ¯ U ˙ . {:(41.106)X=-||xixi^(A) bar(xi)U^(˙)_(||).:}\begin{equation*} X=-\| \xi \xi^{A} \bar{\xi} \dot{U}_{\|} . \tag{41.106} \end{equation*}(41.106)X=ξξAξ¯U˙.
It is enough to mention here this point of principle without going through the details that give the altered sign for α α alpha\alphaα in (41.104). From now on, to preserve the previous arithmetic, change the problem. Deal, not with incoming photons, but with outgoing photons. Replace the receiving telescope by the projector of a planetarium. It projects out into space a separate beam of light for each star of the Big Dipper and also one for the North Star itself. Let an observer move in the positive z z zzz-direction with velocity parameter α α alpha\alphaα. In his frame of reference the beams actually will be widened out in full accord with (41.103).
"The magnification process changes the size of the Big Dipper but not its shape."
This statement is at the same time true and false. It is true of the Dipper and of any other constellation to the extent that the angular dimensions of that constellation can be idealized to be small compared to the entire compass of the sky. It is false in the sense that any well-rounded projected constellation, however small it may appear to an observer at rest with respect to the earth, can always be so "opened out" by the observer putting on any sufficiently high velocity, the observer still being near the earth, that the constellation encompasses a major fraction of the sky.
That the "Lorentz-transformation-induced magnification" of a small object does not change its shape can be seen in three ways. (1) Stereographic projection (Figure 41.8) and "fractional linear transformation" (41.102) are both known to leave all angles unchanged ["conformal invariance;" see for example Penrose (1959)] and known even to turn every old circle into a new circle. (2) Consider a given star, M M MMM, in the constellation and immediate neighbors, L L LLL and N N NNN, just below it and just above it in the count of the members of that constellation. Consider the flagpole pointed at M M MMM and the flag pointed first from M M MMM to L L LLL, then from M M MMM to N N NNN. The flag has turned about the flagpole through an angle ψ ψ psi\psiψ. The two corresponding spinors therefore differ by a phase factor e i ψ / 2 e i ψ / 2 e^(i psi//2)e^{i \psi / 2}eiψ/2. They differ in no other way. After an arbitrary Lorentz transformation they still differ by the phase factor e i ψ / 2 e i ψ / 2 e^(i psi//2)e^{i \psi / 2}eiψ/2, and in no other way. The angle between the arcs M L M L MLM LML and M N M N MNM NMN on the celestial sphere therefore remains at its original value ψ ψ psi\psiψ after the Lorentz transformation (again conformal invariance of patterns on the celestial sphere!). (3) An even more elementary calculation shows that infinitesimal arc lengths on the unit celestial sphere in the direction of increasing θ θ theta\thetaθ and arc lengths in the direction of increasing ϕ ϕ phi\phiϕ are magnified in the same proportion, thus leaving unchanged the angle between arc and arc (conformal invariance). Thus, consider a photon shot out from the planetarium projector to a point on the celestial sphere ("planetarium version of a Big-Dipper star") with inclination θ θ theta\thetaθ to the z z zzz-axis, as seen by an observer at rest relative to the earth. From the standard laws of transformation of angles in a Lorentz transformation ("aberration"; Box 2.4), one has for the sine of the transformed angle
(41.107) sin θ new = ( 1 β 2 ) 1 / 2 1 β cos θ sin θ (41.107) sin θ new  = 1 β 2 1 / 2 1 β cos θ sin θ {:(41.107)sin theta_("new ")=((1-beta^(2))^(1//2))/(1-beta cos theta)sin theta:}\begin{equation*} \sin \theta_{\text {new }}=\frac{\left(1-\beta^{2}\right)^{1 / 2}}{1-\beta \cos \theta} \sin \theta \tag{41.107} \end{equation*}(41.107)sinθnew =(1β2)1/21βcosθsinθ
and (by differentiating the expression for the cosine of the transformed angle)
(41.108) d θ new = ( 1 β 2 ) 1 / 2 1 β cos θ d θ (41.108) d θ new  = 1 β 2 1 / 2 1 β cos θ d θ {:(41.108)dtheta_("new ")=((1-beta^(2))^(1//2))/(1-beta cos theta)d theta:}\begin{equation*} d \theta_{\text {new }}=\frac{\left(1-\beta^{2}\right)^{1 / 2}}{1-\beta \cos \theta} d \theta \tag{41.108} \end{equation*}(41.108)dθnew =(1β2)1/21βcosθdθ
From these expressions it follows at once that the inclination, relative to a meridian line, on the transformed celestial sphere is identical to the direction, relative to the same meridian line, on the original celestial sphere:
tan ( new inclination ) = sin θ new d ϕ new d θ new = sin θ d ϕ d θ (41.109) = tan ( original inclination ) tan (  new   inclination  ) = sin θ new  d ϕ new  d θ new  = sin θ d ϕ d θ (41.109) = tan (  original   inclination  ) {:[tan((" new ")/(" inclination "))=(sin theta_("new ")dphi_("new "))/(dtheta_("new "))=(sin theta d phi)/(d theta)],[(41.109)=tan((" original ")/(" inclination "))]:}\begin{align*} \tan \binom{\text { new }}{\text { inclination }} & =\frac{\sin \theta_{\text {new }} d \phi_{\text {new }}}{d \theta_{\text {new }}}=\frac{\sin \theta d \phi}{d \theta} \\ & =\tan \binom{\text { original }}{\text { inclination }} \tag{41.109} \end{align*}tan( new  inclination )=sinθnew dϕnew dθnew =sinθdϕdθ(41.109)=tan( original  inclination )
Lorentz transformations leave angles on sky unchanged ("conformal invariance")
So much for the elementary spinor and what it has to do with a null vector, with a "flagpole" pointed to the celestial sphere, and with rotation of a "flag" about such a flagpole.

§41.11. SPINORS AS A POWERFUL TOOL IN GRAVITATION THEORY

Spinor formalism in curved spacetime
Spinors needed when analyzing fermions in gravitational fields
Equivalence of spinor and tensor formalisms
Just as vectors, tensors, and differential forms are easily generalized from flat spacetime to curved, so are spinors.
Each event P P P\mathscr{P}P in curved spacetime possesses a tangent space. In that tangent space reside and operate all the vectors, tensors, and forms located at P P P\mathscr{P}P. The geometry of the tangent space is Lorentzian ("local Lorentz geometry at P P P\mathscr{P}P "), since the scalar product of any two vectors u u u\boldsymbol{u}u and v v v\boldsymbol{v}v at P P P\mathscr{P}P, expressed in an orthonormal frame at P P P\mathscr{P}P, is
u v = g ( u , v ) = η α ^ β ^ u α ^ v β ^ u v = g ( u , v ) = η α ^ β ^ u α ^ v β ^ u*v=g(u,v)=eta_( hat(alpha) hat(beta))u^( hat(alpha))v^( hat(beta))\boldsymbol{u} \cdot \boldsymbol{v}=\boldsymbol{g}(\boldsymbol{u}, \boldsymbol{v})=\eta_{\hat{\alpha} \hat{\beta}} u^{\hat{\alpha}} v^{\hat{\beta}}uv=g(u,v)=ηα^β^uα^vβ^
Thus, there is no mathematical difference between the tangent space at P P P\mathscr{P}P on the one hand, and flat spacetime on the other. Whatever mathematical can be done in the one can also be done in the other. In particular, the entire formalism of spinors, developed originally in flat spacetime, can be carried over without change to the tangent space at the arbitrary event P P P\mathscr{P}P in curved spacetime.
Let it be done. Now spinors reside at every event in curved spacetime; and at each event one can translate back and forth between spinor language and tensor language, using the equations (valid in orthonormal frames) of § § 41.6 § § 41.6 §§41.6\S \S 41.6§§41.6 and 41.7.
Spinors in curved spacetime are an indispensible mathematical tool, when one wishes to study the influence of gravity on quantized particles of half-integral spin (neutrinos, electrons, protons, . . .). Consider, for example, Hartle's (1971) proof that a black hole cannot exert any long-range, weak-interaction forces on external matter (i.e., that a black hole has no "weak-interaction hair"). His proof could not function without a spinor description of neutrino fields in curved spacetime. Similarly for Wheeler's (1971b) analysis of the quasibound states of an electron in the gravitational field of a small black hole (gravitational radius 10 13 cm 10 13 cm ∼10^(-13)cm\sim 10^{-13} \mathrm{~cm}1013 cm ): it requires solving the Dirac equation for a spin- 1 2 1 2 (1)/(2)\frac{1}{2}12 particle in the curved spacetime geometry of Schwarzschild. For a detailed discussion of the Dirac equation in curved spacetime see, e.g., Brill and Wheeler (1957).
To use the mathematics of spinors, one need not be dealing with quantum theory or with particles of half-integral spin. The spinor formalism is perfectly applicable in situations where only integral-spin entities (scalars, vectors, tensors) are in view, and where in fact, the spinor formalism is fully equivalent to the tensor formalism that pervades earlier chapters of this book. Equations (41.77) and (41.78) provide the translation from one formalism to the other, once an orthonormal frame has been chosen at each event in spacetime.
Certain types of problems in gravitation theory are far more tractable in the language of spinors than in the language of tensors. Examples are as follows.

(1) Geometric Optics (the theory of "null congruences of geodesics")

Here spinors make almost trivial the lengthy tensor algebra needed in derivations of the "focusing theorem" [equation (22.37)]; and they yield an elegant, simple formalism for discussing and calculating how, with increasing affine parameter, a bundle of rays alters its size ("expansion"), its shape ("shear"), and its orientation ("rotation"). See, e.g., Sachs (1964), Pirani (1965), or Penrose (1968a) for a review and the original references.

(2) Radiation Theory in Curved Spacetime (both gravitational and electromagnetic)

Spinors provide the most powerful of all formalisms for decomposing radiation fields into spherical harmonics and for manipulating their decomposed components. See, for example, Price's (1972a,b) analysis of how a perturbed Schwarzschild black hole radiates away all its radiatable perturbations, be they electromagnetic perturbations, gravitational perturbations, or perturbations in a fictitious field of spin 17; see, similarly, the analysis by Fackerell and Ipser (1972) and by Ipser (1971) of electromagnetic perturbations of a Kerr black hole, and the analysis by Teukolsky (1972) of gravitational perturbations of a Kerr hole. Spinors also yield an elegant and powerful analysis of the " 1 / r 1 / r 1//r1 / r1/r " expansion of a radiation field flowing out from a source into asymptotically flat space. Among its results is a "peeling theorem," which describes the algebraic properties of the coefficients in a 1 / r 1 / r 1//r1 / r1/r expansion of the Riemann tensor. See, e.g., Sachs (1964) or Pirani (1965) for reviews and original references.

(3) Algebraic Properties of Curvature Tensors

The spinor formalism is a more powerful method than any other for deriving the "Petrov-Pirani algebraic classification of the conformal curvature tensor," and for proving theorems about algebraic properties of curvature tensors. See, e.g., Sachs (1964) or Pirani (1965) or Penrose (1968a) for reviews and references.
Of course, the spinor formalism, like any formalism, has its limitations. For example, many of the elementary problems of gravitation theory, and a large fraction of the most difficult ones, would be more difficult in the language of spinors than in the language of tensors! But for certain classes of problems, especially those where null vectors play a central role, spinors are a most valuable tool.
Cartan gave spinors to the world's physics and mathematics. His text (American edition, 1966) is an important reference to the subject.
Applications of spinor formalism in classical gravitation theory

синатен 42

REGGE CALCULUS

This chapter is entirely Track 2. As preparation for it, Chapter 21 (variational principle and initial-value formalism) is needed. It is not needed as preparation for any later chapter, though it will be helpful in Chapter 43 (dynamics of geometry).
The need for Regge calculus as a computational tool

§42.1. WHY THE REGGE CALCULUS?

Gravitation theory is entering an era when situations of greater and greater complexity must be analyzed. Before about 1965 the problems of central interest could mostly be handled by idealizations of special symmetry or special simplicity or both. The Schwarzschild geometry and its generalizations, the Friedmann cosmology and its generalizations, the joining together of the Schwarzschild geometry and the Friedmann geometry to describe the collapse of a bounded collection of matter, the vibrations of relativistic stars, weak gravitational waves propagating in an otherwise flat space: all these problems and others were solved by elementary means.
But today one is pressed to understand situations devoid of symmetry and not amenable to perturbation theory: How do two black holes alter in shape, and how much gravitational radiation do they emit when they collide and coalesce? What are the structures and properties of the singularities at the endpoint of gravitational collapse, predicted by the theorems of Penrose, Hawking, and Geroch? Can a Universe that begins completely chaotic smooth itself out quickly by processes such as inhomogeneous mixmaster oscillations?
To solve such problems, one needs new kinds of mathematical tools-and in response to this need, new tools are being developed. The "global methods" of Chapter 34 provide one set of tools. The Regge Calculus provides another [Regge (1961); see also pp. 467-500 of Wheeler (1964a)].

§42.2. REGGE CALCULUS IN BRIEF

Consider the geodesic dome that covers a great auditorium, made of a multitude of flat triangles joined edge to edge and vertex to vertex. Similarly envisage space-
Approximation of smooth geometries by skeleton structures
time, in the Regge calculus, as made of flat-space "simplexes" (four-dimensional
item in this progression: two dimensions, triangle; three dimensions, tetrahedron; four dimensions, simplex) joined face to face, edge to edge, and vertex to vertex. To specify the lengths of the edges is to give the engineer all he needs in order to know the shape of the roof, and the scientist all he needs in order to know the geometry of the spacetime under consideration. A smooth auditorium roof can be approximated arbitrarily closely by a geodesic dome constructed of sufficiently small triangles. A smooth spacetime manifold can be approximated arbitrarily closely by a locked-together assembly of sufficiently small simplexes. Thus the Regge calculus, reaching beyond ordinary algebraic expressions for the metric, provides a way to analyze physical situations deprived, as so many situations are, of spherical symmetry, and systems even altogether lacking in symmetry.
If the designer can give the roof any shape he pleases, he has more freedom than the analyst who is charting out the geometry of spacetime. Given the geometry of spacetime up to some spacelike slice that, for want of a better name, one may call "now," one has no freedom at all in the geometry from that instant on. Einstein's geometrodynamic law is fully deterministic. Translated into the language of the Regge calculus, it provides a means to calculate the edge lengths of new simplexes from the dimensions of the simplexes that have gone before. Though the geometry is deterministically specified, how it will be approximated is not. The original spacelike hypersurface ("now") is approximated as a collection of tetrahedrons joined together face to face; but how many tetrahedrons there will be and where their vertices will be placed is the option of the analyst. He can endow the skeleton more densely with bones in a region of high curvature than in a region of low curvature to get the most "accuracy profit" from a specified number of points. Some of this freedom of choice for the lengths of the bones remains as one applies the geometrodynamic law in the form given by Regge (1961) to calculate the future from the past. This freedom would be disastrous to any computer program that one tried to write, unless the programmer removed all indefiniteness by adding supplementary conditions of his own choice, either tailored to give good "accuracy profit," or otherwise fixed.
Having determined the lengths of all the bones in the portion of skeletonized spacetime of interest, one can examine any chosen local cluster of bones in and by themselves. In this way one can find out all there is to be learned about the geometry in that region. Of course, the accuracy of one's findings will depend on the fineness with which the skeletonization has been carried out. But in principle that is no limit to the fineness, or therefore to the accuracy, so long as one is working in the context of classical physics. Thus one ends up with a catalog of all the bones, showing the lengths of each. Then one can examine the geometry of whatever spacelike surface one pleases, and look into many other questions besides. For this purpose one has only to pick out the relevant bones and see how they fit together.

§42.3. SIMPLEXES AND DEFICIT ANGLES

Figure 42.1 recalls how a smoothly curved surface can be approximated by flat triangles. All the curvature is concentrated at the vertices. No curvature resides at
Role of Einstein field equation in fixing the skeleton structure
Figure 42.1.
A 2-geometry with continuously varying curvature can be approximated arbitrarily closely by a polyhedron built of triangles, provided only that the number of triangles is made sufficiently great and the size of each sufficiently small. The geometry in each triangle is Euclidean. The curvature of the surface shows up in the amount of deficit angle at each vertex (portion A B C D A B C D ABCDA B C DABCD of polyhedron laid out above on a flat surface).
Deficit angle as a skeletonized measure of curvature:
(1) in two dimensions
(2) in n n nnn (or four) dimensions
the edge between one triangle and the next, despite one's first impression. A vector carried by parallel transport from A A AAA through B B BBB and C C CCC to D D DDD, and then carried back by another route through C C CCC and B B BBB to A A AAA returns to its starting point unchanged in direction, as one sees most easily by laying out this complex of triangles on a flat surface. Only if the route is allowed to encircle the vertex common to A , B , C A , B , C A,B,CA, B, CA,B,C, and D D DDD does the vector experience a net rotation. The magnitude of the rotation is equal to the indicated deficit angle, δ δ delta\deltaδ, at the vertex. The sum of the deficit angles over all the vertices has the same value, 4 π 4 π 4pi4 \pi4π, as does the half-integral of the continuously distributed scalar curvature ( ( 2 ) R = 2 / a 2 ( 2 ) R = 2 / a 2 (^((2))R=2//a^(2):}\left({ }^{(2)} R=2 / a^{2}\right.((2)R=2/a2 for a sphere of radius a a aaa ) taken over the entirety of the original smooth figure,
(42.1) skeleton geometry δ i = 1 2 actual smooth geometry ( 2 ) R d ( surface ) = 4 π . (42.1)  skeleton   geometry  δ i = 1 2  actual smooth   geometry  ( 2 ) R d (  surface  ) = 4 π . {:(42.1)sum_({:[" skeleton "],[" geometry "]:})delta_(i)=(1)/(2)int_({:[" actual smooth "],[" geometry "]:})^((2))Rd(" surface ")=4pi.:}\begin{equation*} \sum_{\substack{\text { skeleton } \\ \text { geometry }}} \delta_{i}=\frac{1}{2} \int_{\substack{\text { actual smooth } \\ \text { geometry }}}{ }^{(2)} R d(\text { surface })=4 \pi . \tag{42.1} \end{equation*}(42.1) skeleton  geometry δi=12 actual smooth  geometry (2)Rd( surface )=4π.
Generalizing from the example of a 2-geometry, Regge calculus approximates a smoothly curved n n nnn-dimensional Riemannian manifold as a collection of n n nnn-dimensional blocks, each free of any curvature at all, joined by ( n 2 ) ( n 2 ) (n-2)(n-2)(n2)-dimensional regions in which all the curvature is concentrated (Box 42.1). For the four-dimensional spacetime of general relativity, the "hinge" at which the curvature is concentrated has the shape of a triangle, as indicated schematically in the bottom row of Figure 42.2. In the example illustrated there, ten tetrahedrons have that triangle in common. Between one of these tetrahedrons and the next fits a four-dimensional simplex. Every feature of this simplex is determined by the lengths of its ten edges. One of the features is the angle α α alpha\alphaα between one of the indicated tetrahedrons or "faces" of the simplex and the next. Thus α α alpha\alphaα represents the angle subtended by this simplex
Box 42.1 THE HINGES WHERE THE CURVATURE IS CONCENTRATED IN THE "ANGLE OF RATTLE" BETWEEN BUILDING BLOCKS IN A SKELETON MANIFOLD
Dimensionality of manifold 2 3 4
Elementary flat-space
building block:
Edge lengths to define it:
Elementary flat-space building block: Edge lengths to define it:| Elementary flat-space | | :--- | | building block: | | Edge lengths to define it: |
triangle tetrahedron simplex
Hinge where cycle of such
blocks meet with a deficit
angle or "angle of rattle" δ : δ : delta:\delta:δ:
Dimensionality of hinge:
"Content" of such a hinge:
Hinge where cycle of such blocks meet with a deficit angle or "angle of rattle" delta: Dimensionality of hinge: "Content" of such a hinge:| Hinge where cycle of such | | :--- | | blocks meet with a deficit | | angle or "angle of rattle" $\delta:$ | | Dimensionality of hinge: | | "Content" of such a hinge: |
3 4 5
Contribution from all hinges
within a given small region
to curvature of manifold:
Contribution from all hinges within a given small region to curvature of manifold:| Contribution from all hinges | | :--- | | within a given small region | | to curvature of manifold: |
0 vertex edge
Dimensionality of manifold 2 3 4 "Elementary flat-space building block: Edge lengths to define it:" triangle tetrahedron simplex "Hinge where cycle of such blocks meet with a deficit angle or "angle of rattle" delta: Dimensionality of hinge: "Content" of such a hinge:" 3 4 5 "Contribution from all hinges within a given small region to curvature of manifold:" 0 vertex edge| Dimensionality of manifold | 2 | 3 | 4 | | :--- | :---: | :---: | :---: | | Elementary flat-space <br> building block: <br> Edge lengths to define it: | triangle | tetrahedron | simplex | | Hinge where cycle of such <br> blocks meet with a deficit <br> angle or "angle of rattle" $\delta:$ <br> Dimensionality of hinge: <br> "Content" of such a hinge: | 3 | 4 | 5 | | Contribution from all hinges <br> within a given small region <br> to curvature of manifold: | 0 | vertex | edge |
Continuum limit of this quantity
expressed as an integral over the same small region:
1 2 ( 2 ) R ( ( 2 ) g ) 1 / 2 d 2 x 1 2 ( 3 ) R ( ( 3 ) g ) 1 / 2 d 3 x 1 2 ( 4 ) R ( ( 4 ) g ) 1 / 2 d 4 x 1 2 ( 2 ) R ( 2 ) g 1 / 2 d 2 x 1 2 ( 3 ) R ( 3 ) g 1 / 2 d 3 x 1 2 ( 4 ) R ( 4 ) g 1 / 2 d 4 x (1)/(2)int^((2))R(^((2))g)^(1//2)d^(2)x quad(1)/(2)int^((3))R(^((3))g)^(1//2)d^(3)x quad(1)/(2)int^((4))R(-^((4))g)^(1//2)d^(4)x\frac{1}{2} \int{ }^{(2)} R\left({ }^{(2)} g\right)^{1 / 2} d^{2} x \quad \frac{1}{2} \int{ }^{(3)} R\left({ }^{(3)} g\right)^{1 / 2} d^{3} x \quad \frac{1}{2} \int{ }^{(4)} R\left(-{ }^{(4)} g\right)^{1 / 2} d^{4} x12(2)R((2)g)1/2d2x12(3)R((3)g)1/2d3x12(4)R((4)g)1/2d4x
at the hinge. Summing the angles α α alpha\alphaα for all the simplexes that meet on the given hinge P R P R PR\mathscr{P} \mathscr{R}PR, and subtracting from 2 π 2 π 2pi2 \pi2π, one gets the deficit angle associated with that hinge. And by then summing the deficit angles in a given small n n nnn-volume with appropriate weighting (Box 42.1), one obtains a number equal to the volume integral of the scalar curvature of the original smooth n n nnn-geometry. See Box 42.2.

§42.4. SKELETON FORM OF FIELD EQUATIONS

Rather than translate Einstein's field equations directly into the language of the skeleton calculus, Regge turns to a standard variational principle from which Einstein's law lets itself be derived. It says (see $ $ 21.2 $ $ 21.2 $$21.2\$ \$ 21.2$$21.2 and 43.3 ) adjust the 4 -geometry throughout an extended region of spacetime, subject to certain specified conditions on the boundary, so that the dimensionless integral (action in units of ! ! ℏ!\hbar!! ),
(42.2) I = ( c 3 / 16 π G ) R ( g ) 1 / 2 d 4 x (42.2) I = c 3 / 16 π G R ( g ) 1 / 2 d 4 x {:(42.2)I=(c^(3)//16 piℏG)int R(-g)^(1//2)d^(4)x:}\begin{equation*} I=\left(c^{3} / 16 \pi \hbar G\right) \int R(-g)^{1 / 2} d^{4} x \tag{42.2} \end{equation*}(42.2)I=(c3/16πG)R(g)1/2d4x
is an extremum. This statement applies when space is free of matter and electromag-
Einstein-Hilbert variational principle reduced to skeleton form
Figure 42.2.
Cycle of building blocks associated with a single hinge. Top row, two dimensions: left, schematic association of vertices S , T , Q , T , W S , T , Q , T , W S,T,Q,T,WS, \mathscr{T}, \mathscr{Q}, \mathscr{T}, \mathscr{W}S,T,Q,T,W with "hinge" at the vertex P P P\mathscr{P}P; right, same, but with elementary triangles indicated in full. Middle row, three dimensions: left, schematic; right, perspective representation of the six tetrahedrons that meet on the "hinge" P P P\mathscr{P}P. Bottom row, four dimensions; shown only
the interior of which space is flat. The five vertices P R Q E P R Q E PRQE\mathscr{P} \mathscr{R} \mathscr{Q} \mathcal{E}PRQE belong to the next simplex; and so on around the cycle of simplexes. The two simplexes just named interface at the tetrahedron P Q Q Q P Q Q Q PQQQ\mathscr{P Q} \mathscr{Q} \mathscr{Q}PQQQ, inside which
angle α α alpha\alphaα subtended at the "hinge" P Q R P Q R PQR\mathscr{P Q R}PQR. The value of this angle is completely fixed by the ten edge
subtended at the hinge P Q R P Q R PQR\mathscr{P Q R}PQR by the other simplexes of the cycle, do not in general add up to 2 π 2 π 2pi2 \pi2π. The deficit, the "angle of rattle" or deficit angle δ δ delta\deltaδ, gives the amount of curvature concentrated at the hinge P P P\mathscr{P}P QR. There is no actual rattle or looseness of fit, unless one tries to imbed the cycle into an over-all flat four-dimensional space (analog of "stamping on" the collection of triangles, and seeing them open out by the amount of the deficit angle, as indicated in inset in Figure 42.1).
netic fields, a simplification that will be made in the subsequent discussion to keep it from becoming too extended. When in addition all lengths are expressed in units of the Planck length
(42.3) L = ( G / c 3 ) 1 / 2 = 1.6 × 10 33 cm , (42.3) L = G / c 3 1 / 2 = 1.6 × 10 33 cm , {:(42.3)L^(**)=(ℏG//c^(3))^(1//2)=1.6 xx10^(-33)cm",":}\begin{equation*} L^{*}=\left(\hbar G / c^{3}\right)^{1 / 2}=1.6 \times 10^{-33} \mathrm{~cm}, \tag{42.3} \end{equation*}(42.3)L=(G/c3)1/2=1.6×1033 cm,
and the curvature integral is approximated by its expression in terms of deficit angles, Regge shows that the statement δ I = 0 δ I = 0 delta I=0\delta I=0δI=0 (condition for an extremum!) becomes
(42.4) ( 1 / 8 π ) δ hinge. h = 1 H A h δ h = 0 . (42.4) ( 1 / 8 π ) δ  hinge.  h = 1 H A h δ h = 0 . {:(42.4)(1//8pi)deltasum_({:[" hinge. "],[h=1]:})^(H)A_(h)delta_(h)=0.:}\begin{equation*} (1 / 8 \pi) \delta \sum_{\substack{\text { hinge. } \\ h=1}}^{H} A_{h} \delta_{h}=0 . \tag{42.4} \end{equation*}(42.4)(1/8π)δ hinge. h=1HAhδh=0.

Box 42.2 FLOW DIAGRAMS FOR REGGE CALCULUS

A skeleton 4-geometry is completely determined by all its edge lengths. From the edge lengths one gets the integrated curvature by pursuing, for each hinge in the 4-geometry, the following flow diagram:
One finds it natural to apply this analysis in either of two ways. First, one can probe a given 4 -geometry (given set of edge lengths!) in the sense

Box 42.2 (continued)

Second-and this is the rationale of Regge calculus-one can use the skeleton calculus to deduce a previously unknown 4-geometry from Einstein's geometrodynamic law, proceeding in the direction
In the changes contemplated in this variational principle, certain edge lengths are thought of as being fixed. They have to do with the conditions specified at the boundaries of the region of spacetime under study. It is not necessary here to enter into the precise formulation of these boundary conditions, fortunately, since some questions of principle still remain to be clarified about the precise formulation of boundary conditions in general relativity (see §21.12). Rather, what is important is the effect of changes in the lengths of the edges of the blocks in the interior of the region being analyzed, as they augment or decrease the deficit angles at the various hinges. In his basic paper on the subject, Regge (1961) notes that the typical deficit angle δ h δ h delta_(h)\delta_{h}δh depends in a complicated trigonometric way on the values of numerous edge lengths p p ℓ_(p)\ell_{p}p. However, he proves (Appendix of his paper) that "quite remarkably, we can carry out the variation as if the δ h δ h delta_(h)\delta_{h}δh were constants," thus reducing the variational principle to the form
(42.5) ( 1 / 8 π ) hinges h i = 1 H δ h δ A h = 0 . (42.5) ( 1 / 8 π )  hinges  h i = 1 H δ h δ A h = 0 . {:(42.5)(1//8pi)sum_({:[" hinges "],[hi=1]:})^(H)delta_(h)deltaA_(h)=0.:}\begin{equation*} (1 / 8 \pi) \sum_{\substack{\text { hinges } \\ h i=1}}^{H} \delta_{h} \delta A_{h}=0 . \tag{42.5} \end{equation*}(42.5)(1/8π) hinges hi=1HδhδAh=0.
Here the change in area of the h h hhh-th triangle-shaped hinge, according to elementary trigonometry, is
(42.6) δ A h = 1 2 p p δ p cotan θ p h . (42.6) δ A h = 1 2 p p δ p cotan θ p h . {:(42.6)deltaA_(h)=(1)/(2)sum_(p)ℓ_(p)deltaℓ_(p)cotantheta_(ph).:}\begin{equation*} \delta A_{h}=\frac{1}{2} \sum_{p} \ell_{p} \delta \ell_{p} \operatorname{cotan} \theta_{p h} . \tag{42.6} \end{equation*}(42.6)δAh=12ppδpcotanθph.
In this equation θ p h θ p h theta_(ph)\theta_{p h}θph is the angle opposite to the p p ppp-th edge in the triangle. Consequently, Einstein's equations in empty space reduce in skeleton geometry to the form
(42.7) hinges that have the give edge p in common δ h cotan θ p h = 0 , ( p = 1 , 2 , ) , (42.7)  hinges that   have the   give edge  p  in common  δ h cotan θ p h = 0 , ( p = 1 , 2 , ) , {:(42.7)sum_({:[" hinges that "],[" have the "],[" give edge "],[p" in common "]:})delta_(h)cotantheta_(ph)=0","quad(p=1","2","dots)",":}\begin{equation*} \sum_{\substack{\text { hinges that } \\ \text { have the } \\ \text { give edge } \\ p \text { in common }}} \delta_{h} \operatorname{cotan} \theta_{p h}=0, \quad(p=1,2, \ldots), \tag{42.7} \end{equation*}(42.7) hinges that  have the  give edge p in common δhcotanθph=0,(p=1,2,),
one equation for each edge length in the interior of the region of spacetime being analyzed.

§42.5. THE CHOICE OF LATTICE STRUCTURE

Two questions arise in the actual application of Regge calculus, and it is not clear that either has yet received the resolution which is most convenient for practical applications of this skeleton analysis: What kind of lattice to use? How best to capitalize on the freedom that exists in the choice of edge lengths? The first question is discussed in this section, the second in the next section.
It might seem most natural to use a lattice made of small, nearly rectangular blocks, the departure of each from rectangularity being conditioned by the amount and directionality of the local curvature. However, such building blocks are "floppy." One could give them rigidity by specifying certain angles as well as the edge lengths. But then one would lose the cleanness of Regge's prescription: give edge lengths, and give only edge lengths, and give each edge length freely and independently, in order to define a geometry. In addition one would have to rederive the Regge equations, including new equations for the determination of the new angles. Therefore one discards the quasirectangle in favor of the simplex with its 5 4 / 2 = 10 5 4 / 2 = 10 5*4//2=105 \cdot 4 / 2=1054/2=10 edge lengths. This decided, one also concludes that even in flat spacetime the simplexes cannot all have identical edge lengths. Two-dimensional flat space can be filled with identical equilateral triangles, but already at three dimensions it ceases to be possible to fill out the manifold with identical equilateral tetrahedrons. One knows that a given carbon atom in diamond is joined to its nearest neighbors with tetrahedral bonds, but a little reflection shows that the cell assignable to the given atom is far from having the shape of an equilateral tetrahedron.
Synthesis would appear to be a natural way to put together the building blocks: first make one-dimensional structures; assemble these into two-dimensional structures; these, into three-dimensional ones; and these, into the final four-dimensional structure. The one-dimensional structure is made of points, 1 , 2 , 3 , 1 , 2 , 3 , 1,2,3,dots1,2,3, \ldots1,2,3,, alternating with line segments, 12 , 23 , 34 , 12 , 23 , 34 , 12,23,34,dots12,23,34, \ldots12,23,34,. To start building a two-dimensional structure, pick up a second one-dimensional structure. It might seem natural to label its points 1 , 2 , 3 , 1 , 2 , 3 , 1^('),2^('),3^('),dots1^{\prime}, 2^{\prime}, 3^{\prime}, \ldots1,2,3,, etc. However, that labeling would imply a cross-connection between 1 and 1 1 1^(')1^{\prime}1, between 2 and 2 2 2^(')2^{\prime}2, etc., after the fashion of a ladder. Then the elementary cells would be quasirectangles. They would have the "floppiness" that is to be excluded. Therefore relabel the points of the second one-dimensional structure as 1 1 2 1 2 , 2 1 2 , 3 1 2 1 1 2 1 2 , 2 1 2 , 3 1 2 1(1)/(2)^((1)/(2)),2(1)/(2)^('),3(1)/(2)^(')1 \frac{1}{2}^{\frac{1}{2}}, 2 \frac{1}{2}^{\prime}, 3 \frac{1}{2}^{\prime}11212,212,312, etc. The implication is that one cross-connects 2 1 2 1 2(1^('))/()2 \frac{1^{\prime}}{}21 with points 2 and 3 of the original one-dimensional structure, etc. One ends up with something like the
Einstein field equation reduced to skeleton form
The choice of lattice structure:
(1) avoiding floppiness
(2) necessity for unequal edge lengths
(3) construction of twodimensional structures

(4) 3-D structures built from 2-D structures by "method of blocks"
(5) 3-D structures from 2-D by "method of spheres"
girder structure of a bridge, fully rigid in the context of two dimensions, as desired. The same construction, extended, fills out the plane with triangles. One now has a simple, standard two-dimensional structure. One might mistakenly conclude that one is ready to go ahead to build up a three-dimensional structure: the mistake lies in the tacit assumption that the flat-space topology is necessarily correct.
Let it be the problem, for example, to determine the development in time of a 3 -geometry that has the topology of a 3 -sphere. This 3 -sphere is perhaps strongly deformed from ideality by long-wavelength gravitational waves. A right arrangement of the points is the immediate desideratum. Therefore put aside for the present any consideration of the deformation of the geometry by the waves (alteration of edge lengths from ideality). Ask how to divide a perfect 3-sphere into two-dimensional sheets. Here each sheet is understood to be separated from the next by a certain distance. At this point two alternative approaches suggest themselves that one can call for brevity "blocks" and "spheres."
(1) Blocks. Note that a 3-sphere lets itself be decomposed into 5 identical, tetra-hedron-like solid blocks ( 5 vertices; 5 ways to leave out any one of these vertices!) Fix on one of these "tetrahedrons." Select one vertex as summit and the face through the other three vertices as base. Give that base the two-dimensional lattice structure already described. Introduce a multitude of additional sheets piled above it as evenly spaced layers reaching to the summit. Each layer has fewer points than the layer before. The decomposition of the 3-geometry inside one "tetrahedron" is thereby accomplished. However, an unresolved question remains; not merely how to join on this layered structure in a regular way to the corresponding structure in the adjacent "tetrahedrons"; but even whether such a regular joinup is at all possible. The same question can be asked about the other two ways to break up the 3-sphere into identical "tetrahedrons" [Coxeter (1948), esp. pp. 292-293: 16 tetrahedrons defined by a total of 8 vertices or 600 tetrahedrons defined by a total of 120 vertices]. One can eliminate the question of joinup of structure in a simple way, but at the price of putting a ceiling on the accuracy attainable: take the stated number of vertices ( 5 or 8 or 120) as the total number of points that will be employed in the skeletonization of the 3 -geometry (no further subdivision required or admitted). Considering the boundedness of the memory capacity of any computer, it is hardly ridiculous to contemplate a limitation to 120 tracer points in exploratory calculations!
(2) Spheres. An alternative approach to the "atomization" of the 3-sphere begins by introducing on the 3 -sphere a North Pole and a South Pole and the hyperspherical angle χ ( χ = 0 χ ( χ = 0 chi(chi=0\chi(\chi=0χ(χ=0 at the first pole, χ = π χ = π chi=pi\chi=\piχ=π at the second, χ = π / 2 χ = π / 2 chi=pi//2\chi=\pi / 2χ=π/2 at the equator; see Box 27.2). Let each two-dimensional layer lie on a surface of constant χ χ chi\chiχ ( χ χ chi\chiχ equal to some integer times some interval Δ χ Δ χ Delta chi\Delta \chiΔχ ). The structure of this 2 -sphere is already to be regarded as skeletonized into elementary triangles ("fully complete Buckminster Fuller geodesic dome"). Therefore the number of "faces" or triangles F F FFF, the number of edge lengths E E EEE, and the number of vertices V V VVV must be connected by the relation of Euler:
(42.8) F E + V = ( a topology-dependent number or "Euler character" ) = { 2 for 2-sphere , 0 for 2-torus. (42.8) F E + V = (  a topology-dependent   number or "Euler character"  ) = 2  for 2-sphere  , 0  for 2-torus.  {:(42.8)F-E+V=((" a topology-dependent ")/(" number or "Euler character" "))={[2" for 2-sphere "","],[0" for 2-torus. "]:}:}F-E+V=\binom{\text { a topology-dependent }}{\text { number or "Euler character" }}=\left\{\begin{array}{l} 2 \text { for 2-sphere }, \tag{42.8}\\ 0 \text { for 2-torus. } \end{array}\right.(42.8)FE+V=( a topology-dependent  number or "Euler character" )={2 for 2-sphere ,0 for 2-torus. 
It follows from this relation that it is impossible for each vertex to sit at the center
of a hexagon (each vertex the point of convergence of 6 triangles). This being the case, one is not astonished that a close inspection of the pattern of a geodesic dome shows several vertices where only 5 triangles meet. It is enough to have 12 such 5 -triangle vertices among what are otherwise all 6 -triangle vertices in order to meet the requirements of the Euler relation:
n 5-triangle vertices V n 6-triangle vertices F = ( V n ) ( 6 / 3 ) + n ( 5 / 3 ) triangles (42.9) E = ( V n ) ( 6 / 2 ) + n ( 5 / 2 ) edges V = ( V n ) ( 6 / 6 ) + n vertices 2 = F E + V = n / 6 Euler characteristic n = 12 n  5-triangle vertices  V n  6-triangle vertices  F = ( V n ) ( 6 / 3 ) + n ( 5 / 3 )  triangles  (42.9) E = ( V n ) ( 6 / 2 ) + n ( 5 / 2 )  edges  V = ( V n ) ( 6 / 6 ) + n  vertices  2 = F E + V = n / 6  Euler characteristic  n = 12 {:[nquad" 5-triangle vertices "],[V-nquad" 6-triangle vertices "],[F=(V-n)(6//3)+n(5//3)" triangles "],[(42.9)E=(V-n)(6//2)+n(5//2)" edges "],[V=(V-n)(6//6)+n" vertices "],[2=F-E+V=n//6quad" Euler characteristic "],[n=12]:}\begin{align*} n & \quad \text { 5-triangle vertices } \\ V-n & \quad \text { 6-triangle vertices } \\ F & =(V-n)(6 / 3)+n(5 / 3) \text { triangles } \\ E & =(V-n)(6 / 2)+n(5 / 2) \text { edges } \tag{42.9}\\ V & =(V-n)(6 / 6)+n \text { vertices } \\ 2=F-E+V & =n / 6 \quad \text { Euler characteristic } \\ n & =12 \end{align*}n 5-triangle vertices Vn 6-triangle vertices F=(Vn)(6/3)+n(5/3) triangles (42.9)E=(Vn)(6/2)+n(5/2) edges V=(Vn)(6/6)+n vertices 2=FE+V=n/6 Euler characteristic n=12
Among all figures with triangular faces, the icosahedron is the one with the smallest number of faces that meets this condition (5-triangle vertices exclusively!)
If each 2 -surface has the pattern of vertices of a geodesic dome, how is one dome to be joined to the next to make a rigid skeleton 3-geometry? Were the domes imbedded in a flat 3-geometry, rigidity would be no issue. Each dome would already be rigid in and by itself. However, the 3 -geometry is not given to be flat. Only by a completely deterministic skeletonization of the space between the two 2 -spheres will they be given rigidity in the context of curved space geometry. (1) Not by running a single connector from each vertex in one surface to the corresponding vertex in the next ("floppy structure"!) (2) Not by displacing one surface so each of its vertices comes above, or nearly above, the center of a triangle in the surface "below." First, the numbers of vertices and triangles ordinarily will not agree. Second, even when they do, it will not give the structure the necessary rigidity to connect the vertex of the surface above to the three vertices of the triangle below. The space between will contain some tetrahedrons, but it will not be throughout decomposed into tetrahedrons. (3) A natural and workable approach to the skeletonization of the 3 -geometry is to run a connector from each vertex in the one surface to the corresponding vertex in the next, but to flesh out this connection with additional structure that will give rigidity to the 3 -geometry: intervening vertices and connectors as illustrated in Box 42.3.
In working up from the skeletonization of a 3-geometry to the skeletonization of a 4-geometry, it is natural to proceed similarly. (1) Use identical patterns of points in the two 3 -geometries. (2) Tie corresponding points together by single connectors. (3) Halfway, or approximately half way between the two 3-geometries insert a whole additional pattern of vertices. Each of these supplementary vertices is "dual" to and lies nearly "below" the center of a tetrahedron in the 3-geometry immediately above. (4) Connect each supplementary vertex to the vertices of the tetrahedron immediately above, to the vertices of the tetrahedron immediately below, and to those other supplementary vertices that are its immediate neighbors. (5) In this way get the edge lengths needed to divide the 4 -geometry into simplexes, each of rigidly defined dimensions.
(6) 4-D structures built from 3-D structures

Box 42.3 SYNTHESIS OF HIGHER-DIMENSIONAL SKELETON GEOMETRIES OUT OF LOWER-DIMENSIONAL SKELETON GEOMETRIES


(1) One-dimensional structure as alternation of points and line segments. (2) Two-dimensional structure (a) "floppy" (unacceptable) and (b) rigidified (angles of triangles fully determined by edge lengths). When this structure is extended, as at right, the "normal" vertex has six triangles hinging on it. However, at least twelve 5 -triangle vertices of the type indicated at a a aaa are to be interpolated if the 2 -geometry is to be able to close up into a 2 -sphere. (3) Skeleton 3 -geometry obtained by filling in between the skeleton 2 -geometry... Q B F P C E D Q B F P C E D QBdotsFPCdotsEDdots\mathscr{Q} \mathscr{B} \ldots \mathscr{F} \mathscr{P C} \ldots \mathscr{E} \mathscr{D} \ldotsQBFPCED and the similar structure a B F P C E D a B F P C E D dotsa^(')B^(')dotsF^(')P^(')C^(')dotsE^(')D^(')dots\ldots \mathscr{a}^{\prime} \mathscr{B}^{\prime} \ldots \mathscr{F}^{\prime} \mathscr{P}^{\prime} \mathcal{C}^{\prime} \ldots \mathcal{E}^{\prime} \mathscr{D}^{\prime} \ldotsaBFPCED as follows. (a) Insert direct connectors such as P P P P PP^(')\mathscr{P} \mathscr{P}^{\prime}PP between corresponding points in the two 2 -geometries. (b) Insert an intermediate layer of "supplementary vertices" such as 5 T V T F . 5 T V T F . 5TVTF.5 \mathscr{T} \mathscr{V} \mathscr{T} \mathscr{F} .5TVTF.. . Each of these supplementary vertices lies roughly halfway between the center of the triangle "above" it and the center of the corresponding triangle "below" it. (c)
Connect each such "supplementary vertex" with its immediate neighbors above, below, and in the same plane. (d) Give all edge lengths. (e) Then the skeleton 3 -geometry between the two 2 -geometries is rigidly specified. It is made up of five types of tetrahedrons, as follows. (1) "Rightthrough blocks," such as P P P S T P P P S T PPP^(')ST\mathscr{P P} \mathscr{P}^{\prime} S \mathscr{T}PPPST (six of these hinge on P P P P PP^(')\mathscr{P} \mathscr{P}^{\prime}PP when P P P\mathscr{P}P is a normal vertex; five, when it is a 5 -fold vertex, such as indicated by a a aaa at the upper right). (2) "Lower-facing blocks," such as Q P T Q P T QPT\mathscr{Q} \mathscr{P} \mathscr{T}QPT. (3) "Lower-packing blocks," such as Q®ST . ( 4 , 5 ) ( 4 , 5 ) (4,5)(4,5)(4,5) Corresponding "upper-facing blocks" and "upper-packing blocks" (not shown). The number of blocks of each kind is appropriately listed here for the two extreme cases of a 2-geometry that consists (a) of a normal hexagonal lattice extending indefinitely in a plane and (b) of a lattice consisting of the minimum number of 5 -fold vertices ("type a a aaa vertices") that will permit closeup into a 2 -sphere.
2-geometry of upper
(or lower) face
2-geometry of upper (or lower) face| 2-geometry of upper | | :--- | | (or lower) face |
Hexagonal pattern
of triangles
Hexagonal pattern of triangles| Hexagonal pattern | | :--- | | of triangles |
Icosahedron
made of triangles
Icosahedron made of triangles| Icosahedron | | :--- | | made of triangles |
Its topology Infinite 2-plane 2-sphere
Vertices on upper face V 12
Nature of these vertices 6 -fold 5 -fold
Edge lengths on upper face 3 V 5 V = 30 5 V = 30 5V=305 \mathrm{~V}=305 V=30
Triangles on upper face 2 V 20
Number of "supplementary vertices" 2 V 20
Outer facing blocks 2 V 20
Outer packing blocks 3 V 30
Right through blocks 6 V 60
Inner packing blocks 3 V 30
Inner facing blocks 2 V 20
"2-geometry of upper (or lower) face" "Hexagonal pattern of triangles" "Icosahedron made of triangles" Its topology Infinite 2-plane 2-sphere Vertices on upper face V 12 Nature of these vertices 6 -fold 5 -fold Edge lengths on upper face 3 V 5V=30 Triangles on upper face 2 V 20 Number of "supplementary vertices" 2 V 20 Outer facing blocks 2 V 20 Outer packing blocks 3 V 30 Right through blocks 6 V 60 Inner packing blocks 3 V 30 Inner facing blocks 2 V 20| 2-geometry of upper <br> (or lower) face | Hexagonal pattern <br> of triangles | Icosahedron <br> made of triangles | | :--- | :---: | :---: | | Its topology | Infinite 2-plane | 2-sphere | | Vertices on upper face | V | 12 | | Nature of these vertices | 6 -fold | 5 -fold | | Edge lengths on upper face | 3 V | $5 \mathrm{~V}=30$ | | Triangles on upper face | 2 V | 20 | | Number of "supplementary vertices" | 2 V | 20 | | Outer facing blocks | 2 V | 20 | | Outer packing blocks | 3 V | 30 | | Right through blocks | 6 V | 60 | | Inner packing blocks | 3 V | 30 | | Inner facing blocks | 2 V | 20 |

§42.6. THE CHOICE OF EDGE LENGTHS

So much for the lattice structure of the 4-geometry; now for the other issue, the freedom that exists in the choice of edge lengths. Why not make the simplest choice and let all edges be light rays? Because the 4 -geometry would not then be fully determined. The geometry g α β ( x μ ) g α β x μ g_(alpha beta)(x^(mu))g_{\alpha \beta}\left(x^{\mu}\right)gαβ(xμ) differs from the geometry λ ( x μ ) g α β ( x μ ) λ x μ g α β x μ lambda(x^(mu))g_(alpha beta)(x^(mu))\lambda\left(x^{\mu}\right) g_{\alpha \beta}\left(x^{\mu}\right)λ(xμ)gαβ(xμ), even though the same points that are connected by light rays in the one geometry are also connected by light rays in the other geometry.
If none of the edges is null, it is nevertheless natural to take some of the edge lengths to be spacelike and some to be timelike. In consequence the area A A AAA of the triangle in some cases will be real, in other cases imaginary. In 3-space the parallelogram (double triangle) spanned by two vectors B B B\boldsymbol{B}B and C C C\boldsymbol{C}C is described by a vector
2 A = B × C 2 A = B × C 2A=B xx C2 A=B \times C2A=B×C
perpendicular to the two vectors. One obtains the magnitude of A A A\boldsymbol{A}A from the formula
4 A 2 = B 2 C 2 ( B C ) 2 4 A 2 = B 2 C 2 ( B C ) 2 4A^(2)=B^(2)C^(2)-(B*C)^(2)4 A^{2}=B^{2} C^{2}-(\boldsymbol{B} \cdot \boldsymbol{C})^{2}4A2=B2C2(BC)2
In 4-space, let B B B\boldsymbol{B}B and C C C\boldsymbol{C}C be two edges of the triangle. Then, as in three dimensions, 2 A 2 A 2A2 \boldsymbol{A}2A is dual to the bivector built from B B B\boldsymbol{B}B and C C C\boldsymbol{C}C. In other words, if B B B\boldsymbol{B}B goes in the t t ttt direction and C C C\boldsymbol{C}C in the z z zzz direction, then A A A\boldsymbol{A}A is a bivector lying in the ( x , y ) ( x , y ) (x,y)(x, y)(x,y) plane. Consequently its magnitude A A AAA is to be thought of as a real quantity. Therefore the appropriate formula for the area A A AAA is (Tullio Regge)
(42.10) 4 A 2 = ( B C ) 2 B 2 C 2 (42.10) 4 A 2 = ( B C ) 2 B 2 C 2 {:(42.10)4A^(2)=(B*C)^(2)-B^(2)C^(2):}\begin{equation*} 4 A^{2}=(\boldsymbol{B} \cdot \boldsymbol{C})^{2}-\boldsymbol{B}^{2} \boldsymbol{C}^{2} \tag{42.10} \end{equation*}(42.10)4A2=(BC)2B2C2
The quantity A A AAA is real when the deficit angle δ δ delta\deltaδ is real. Thus the geometrically important product A δ A δ A deltaA \deltaAδ is also real.
The choice of edge lengths:
(1) choose some timelike, others spacelike

(2) choose timelike lengths comparable to spacelike lengths
(3) why some lengths must be chosen arbitrarily
Deficit angles in terms of edge lengths
Past applications of Regge calculus
When the hinge lies in the ( x , y ) ( x , y ) (x,y)(x, y)(x,y) plane, on the other hand, the quantity A A AAA is purely imaginary. In that instance a test vector taken around the cycle of simplexes that swing on this hinge has undergone change only in its z z zzz and t t ttt components; that is, it has experienced a Lorentz boost; that is, the deficit angle δ δ delta\deltaδ is also purely imaginary. So again the product A δ A δ A deltaA \deltaAδ is a purely real quantity.
Turn now from character of edge lengths to magnitude of edge lengths. It is desirable that the elementary building blocks sample the curvatures of space in different directions on a roughly equal basis. In other words, it is desirable not to have long needle-shaped building blocks nor pancake-shaped tetrahedrons and simplexes. This natural requirement means that the step forward in time should be comparable to the steps "sidewise" in space. The very fact that one should have to state such a requirement brings out one circumstance that should have been obvious before: the "hinge equations"
(42.7) hinges thet haveedgep in common δ h cotan θ p h = 0 ( p = 1 , 2 , ) , (42.7)  hinges   thet   haveedgep   in common  δ h cotan θ p h = 0 ( p = 1 , 2 , ) , {:(42.7)sum_({:[" hinges "" thet "],[" haveedgep "],[" in common "]:})delta_(h)cotantheta_(ph)=0quad(p=1","2","dots)",":}\begin{equation*} \sum_{\substack{\text { hinges } \text { thet } \\ \text { haveedgep } \\ \text { in common }}} \delta_{h} \operatorname{cotan} \theta_{p h}=0 \quad(p=1,2, \ldots), \tag{42.7} \end{equation*}(42.7) hinges  thet  haveedgep  in common δhcotanθph=0(p=1,2,),
though they are as numerous as the edges, cannot be regarded as adequate to determine all edge lengths. There are necessarily relations between these equations that keep them from being independent. The equations cannot determine all the details of the necessarily largely arbitrary skeletonization process. They cannot do so any more than the field equations of general relativity can determine the coordinate system. With a given pattern of vertices (four-dimensional generalization of drawings in Box 42.3), one still has (a) the option how close together one will take successive layers of the structure and (b) how one will distribute a given number of points in space on a given layer to achieve the maximum payoff in accuracy (greater density of points in regions of greater curvature). To prepare a practical computer program founded on Regge calculus, one has to supply the machine not only with the hinge equations and initial conditions, but also with definite algorithms to remove all the arbitrariness that resides in options (a) and (b).
Formulas from solid geometry and four-dimensional geometry, out of which to determine the necessary hyperdihedral angles α α alpha\alphaα and the deficit angles δ δ delta\deltaδ in terms of edge lengths and nothing but edge lengths, are summarized by Wheeler (1964a, pp. 469,470 , and 490 ) and by C. Y. Wong (1971). Regge (1961) also gives a formula for the Riemann curvature tensor itself in terms of deficit angles and number of edges running in a given direction [see also Wheeler (1964a, p. 471)].

§42.7. PAST APPLICATIONS OF REGGE CALCULUS

Wong (1971) has applied Regge calculus to a problem where no time development shows itself, where the geometry can therefore be treated as static, and where in addition it is spherically symmetric. He determined the Schwarzschild and ReissnerNordstrøm geometries by the method of skeletonization. Consider successive spheres
surrounding the center of attraction. Wong approximates each as an icosahedron. The condition
( 3 ) R = 16 π ( energy density on the 3-space ) ( 3 ) R = 16 π (  energy density   on the 3-space  ) ^((3))R=16 pi((" energy density ")/(" on the 3-space ")){ }^{(3)} R=16 \pi\binom{\text { energy density }}{\text { on the 3-space }}(3)R=16π( energy density  on the 3-space )
( $ 21.5 $ 21.5 $21.5\$ 21.5$21.5 ) gives a recursion relation that determines the dimension of each icosahedron in terms of the two preceding icosahedra. Errors in the skeleton representation of the exact geometry range from roughly 10 percent to less than 1 percent, depending on the method of analysis, the quantity under analysis, and the fineness of the subdivision.
Skeletonization of geometry is to be distinguished from mere rewriting of partial differential equations as difference equations. One has by now three illustrations that one can capitalize on skeletonization without fragmenting spacetime all the way to the level of individual simplexes. The first illustration is the first part of Wong's work, where the time dimension never explicitly makes an appearance, so that the building blocks are three-dimensional only. The second is an alternative treatment, also given by Wong, that goes beyond the symmetry in t t ttt to take account of the symmetry in θ θ theta\thetaθ and ϕ ϕ phi\phiϕ. It divides space into spherical shells, in each of which the geometry is "pseudo-flat" in much the same sense that the geometry of a paper cone is flat. The third is the numerical solution for the gravitational collapse of a spherical star by May and White (1966), in which there is symmetry in θ θ theta\thetaθ and ϕ ϕ phi\phiϕ, but not in r r rrr or t t ttt. This zoning takes place exclusively in the r , t r , t r,tr, tr,t-plane. Each zone is a spherical shell. The difference as compared to Regge calculus (flat geometry within each building block) is the adjustable "conicity" given to each shell. The examples show that the decision about skeletonizing the geometry in a calculation is ordinarily not "whether" but "how much."

§42.8. THE FUTURE OF REGGE CALCULUS

In summary, Regge's skeleton calculus puts within the reach of computation problems that in practical terms are beyond the power of normal analytical methods. It affords any desired level of accuracy by sufficiently fine subdivision of the spacetime region under consideration. By way of its numbered building blocks, it also offers a practical way to display the results of such calculations. Finally, one can hope that Regge's truly geometric way of formulating general relativity will someday make the content of the Einstein field equations (Cartan's "moment of rotation"; see Chapter 15) stand out sharp and clear, and unveil the geometric significance of the so-called "geometrodynamic field momentum" (analysis of the boundary-value problem associated with the variational problem of general relativity in Regge calculus; see §21.12).

cmuma 43

SUPERSPACE: ARENA FOR THE DYNAMICS OF GEOMETRY

Traveler, there are no paths.
Paths are made by walking.
ANTONIO MACHADO (1940)
This chapter is entirely Track 2. Chapter 21 (initial-value formalism) is needed as preparation for it. In reading it, one will be helped by Chapter 42 (Regge calculus). It is not needed as preparation for any later chapter, but it will be helpful in Chapter 44 (vision of the future).
Superspace is the arena for geometrodynamics

§43.1. SPACE, SUPERSPACE, AND SPACETIME DISTINGUISHED

Superspace [Wheeler (1964a), pp. 459 ff ] is the arena of geometrodynamics. The dynamics of Einstein's curved space geometry runs its course in superspace as the dynamics of a particle unfolds in spacetime. This chapter gives one simple version of superspace, and a little impression of alternative versions of superspace that also have their uses. It describes the classical dynamics of geometry in superspace in terms of the Hamilton-Jacobi principle of Boxes 25.3 and 25.4. No version of mechanics makes any shorter the leap from classical dynamics to quantum. Thus it provides a principle ("Einstein-Hamilton-Jacobi or EHJ equation") for the propagation of wave crests in superspace, and for finding where those wave crests give one the classical equivalent of constructive interference ("envelope formation"). In this way one finds the track of development of 3-geometry with time expressed as a sharp, thin "leaf of history" that slices through superspace. The quantum principle replaces this deterministic account with a fuzzed-out leaf of history of finite thickness. In consequence, quantum fluctuations take place in the geometry of space that dominate the scene at distances of the order of the Planck length, L = ( G / c 3 ) 1 / 2 = 1.6 × L = G / c 3 1 / 2 = 1.6 × L^(**)=(ℏG//c^(3))^(1//2)=1.6 xxL^{*}=\left(\hbar G / c^{3}\right)^{1 / 2}=1.6 \timesL=(G/c3)1/2=1.6× 10 33 cm 10 33 cm 10^(-33)cm10^{-33} \mathrm{~cm}1033 cm, and less. The present survey simplifies by considering only the dynamics of curved empty space. When sources are present and are to be taken into account, supplementary terms are to be added, some of the literature on which is listed.
In all the difficult investigations that led in the course of half a century to some understanding of the dynamics of geometry, both classical and quantum, the most
Box 43.1 GEOMETRODYNAMICS COMPARED WITH PARTICLE DYNAMICS
Concept Particle dynamics Geometrodynamics
Dynamic entity Particle Space
Descriptors of momentary configuration x , t x , t x,tx, tx,t ("event") (3) % % %\%% ("3-geometry")
Classical history x = x ( t ) x = x ( t ) x=x(t)x=x(t)x=x(t) (4) % % %\%% ("4-geometry")
History is a stockpile of configurations? Yes. Every point on world line gives a momentary configuration of particle Yes. Every spacelike slice through ( 4 ) ( 4 ) ^((4)){ }^{(4)}(4) ) gives a momentary configuration of space
Dynamic arena Spacetime (totality of all points x , t x , t x,tx, tx,t ) Superspace (totality of all (3) E's' ^(')^{\prime}
Concept Particle dynamics Geometrodynamics Dynamic entity Particle Space Descriptors of momentary configuration x,t ("event") (3) % ("3-geometry") Classical history x=x(t) (4) % ("4-geometry") History is a stockpile of configurations? Yes. Every point on world line gives a momentary configuration of particle Yes. Every spacelike slice through ^((4)) ) gives a momentary configuration of space Dynamic arena Spacetime (totality of all points x,t ) Superspace (totality of all (3) E's' ^(')| Concept | Particle dynamics | Geometrodynamics | | :---: | :---: | :---: | | Dynamic entity | Particle | Space | | Descriptors of momentary configuration | $x, t$ ("event") | (3) $\%$ ("3-geometry") | | Classical history | $x=x(t)$ | (4) $\%$ ("4-geometry") | | History is a stockpile of configurations? | Yes. Every point on world line gives a momentary configuration of particle | Yes. Every spacelike slice through ${ }^{(4)}$ ) gives a momentary configuration of space | | Dynamic arena | Spacetime (totality of all points $x, t$ ) | Superspace (totality of all (3) E's' $^{\prime}$ |
difficult point was also the simplest: The dynamic object is not spacetime. It is space. The geometric configuration of space changes with time. But it is space, three-dimensional space, that does the changing (see Box 43.1).
Space will be treated here as "closed" or, in mathematical language, "compact," either because physics adds to Einstein's second-order differential equations the requirement of closure as a necessary and appropriate boundary condition [Einstein (1934, p. 52; 1950); Wheeler (1959; 1964c). Hönl (1962); see also §21.12] or because that requirement simplifies the mathematical analysis, or for both reasons together.
One can approximate a smooth, closed 3-geometry by a skeleton 3-geometry built out of tetrahedrons, as indicated schematically in Figure 43.1 (see Chapter 42 on the Regge calculus). Specify the 98 edge-lengths in this example to fix all the features of the geometry; and fix these 98 edge-lengths by giving the location of a single point in a space of 98 dimensions. This 98 -dimensional manifold, this "truncated superspace," goes over into superspace [Wheeler (1964a), pp. 453, 459, 463, 495] in the idealization in which the tracer points increase in density of coverage without limit. Accounts of superspace with more mathematical detail are given by DeWitt (1967a,b), Wheeler (1970), and Fischer (1970).
Let the representative point move from one location to a nearby location, either in truncated superspace or in full superspace. Then all edge-lengths alter, and the 3-geometry of Figure 43.1 moves as if alive. No better illustration can one easily supply of what it means to speak of the "dynamics of space."
The term "3-geometry" makes sense as well in quantum geometrodynamics as in classical theory. So does superspace. But spacetime does not. Give a 3-geometry, and give its time rate of change. That is enough, under typical circumstances (see Chapter 21) to fix the whole time-evolution of the geometry; enough in other words, to determine the entire four-dimensional spacetime geometry, provided one is
3 -geometry is the dynamic object
Finite-dimensional "truncated superspace"
Figure 43.1.
Superspace in the simplicial approximation. Upper left, space (depicted as two-dimensional but actually three-dimensional). Upper right, simplical approximation to space. The approximation can be made arbitrarily good by going to the limit of an arbitrarily fine decomposition. The curvature at a typical location is measured by a deficit angle. This angle is completely determined by the edge lengths ( L 1 L 1 L_(1)L_{1}L1, L 2 , L 8 L 2 , L 8 L_(2),dotsL_(8)L_{2}, \ldots L_{8}L2,L8 in the figure) of the simplexes that meet at that location. When there are 98 edge lengths altogether in the simplicial representation of the geometry, then this geometry is completely specified by a single point in a 98 -dimensional space (lower diagram; "superspace").
The concept of spacetime is incompatible with the quantum principle
considering the problem in the context of classical physics. In the real world of quantum physics, however, one cannot give both a dynamic variable and its time-rate of change. The principle of complementarity forbids. Given the precise 3-geometry at one instant, one cannot also know at that instant the time-rate of change of the 3 -geometry. In other words, given the geometrodynamic field coordinate, one cannot know the geometrodynamic field momentum. If one assigns the intrinsic 3-geometry, one cannot also specify the extrinsic curvature.
The uncertainty principle thus deprives one of any way whatsoever to predict, or even to give meaning to, "the deterministic classical history of space evolving
in time." No prediction of spacetime, therefore no meaning for spacetime, is the verdict of the quantum principle. That object which is central to all of classical general relativity, the four-dimensional spacetime geometry, simply does not exist, except in a classical approximation.
These considerations reveal that the concepts of spacetime and time are not primary but secondary ideas in the structure of physical theory. These concepts are valid in the classical approximation. However, they have neither meaning nor application under circumstances where quantum geometrodynamic effects become important. Then one has to forego that view of nature in which every event, past, present, or future, occupies its preordained position in a grand catalog called "spacetime," with the Einstein interval from each event to its neighbor eternally established. There is no spacetime, there is no time, there is no before, there is no after. The question of what happens "next" is without meaning.
That spacetime is not the right way does not mean there is no right way to describe the dynamics of geometry consistent with the quantum principle. Superspace is the key to one right way to describe the dynamics (see Figure 43.2).
Figure 43.2.
Space, spacetime, and superspace. Upper left: Five sample configurations, A , B , C , D , E A , B , C , D , E A,B,C,D,EA, B, C, D, EA,B,C,D,E, attained by space in the course of its expansion and recontraction. Below: Superspace and these five sample configurations, each represented by a point in superspace. Upper right: Spacetime. A spacelike cut, like A A AAA, through spacetime gives a momentary configuration of space. There is no compulsion to limit attention to a one-parameter family of slices, A , B , C , D , E A , B , C , D , E A,B,C,D,EA, B, C, D, EA,B,C,D,E through spacetime. The phrase "many-fingered time" is a slogan telling one not to so limit one's slices, and B B B^(')B^{\prime}B is an example of this freedom in action. The 3-geometries B B B^(')B^{\prime}B and A , B , C , D , E A , B , C , D , E A,B,C,D,EA, B, C, D, EA,B,C,D,E, like all 3-geometries obtained by all spacelike slices whatsoever through the given classical spacetime, lie on a single bent leaf of history, indicated in the diagram, and cutting its thin slice through superspace. A different spacetime, in other words, a different solution of Einstein's field equation, means a different leaf of history (not indicated) slicing through superspace.

§43.2. THE DYNAMICS OF GEOMETRY DESCRIBED IN THE LANGUAGE OF THE SUPERSPACE OF THE ( 3 ) y ( 3 ) y ^((3))y^('){ }^{(3)} y^{\prime}(3)y 's

Spacetime is a classical leaf of history slicing through superspace
Given a spacetime, one can construct the corresponding leaf of history slicing through superspace. Conversely, given the leaf of history, one can reconstruct the spacetime.
Consider the child's toy commonly known as "Chinese boxes." One opens the outermost box only to reveal another box; when the second box is opened, there is another box, and so on, until eventually there are dozens of boxes scattered over the floor. Or conversely the boxes can be put back together, nested one inside the other, to reconstitute the original package. The packaging of ( 3 ) y ( 3 ) y ^((3))y^('){ }^{(3)} \mathscr{y}^{\prime}(3)y s into a ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} \mathscr{y}(4)y is much more sophisticated. Nature provides no monotonic ordering of the ( 3 ) y ( 3 ) y ^((3))y^('){ }^{(3)} y^{\prime}(3)y 's. Two of the dynamically allowed ( 3 ) ξ ( 3 ) ξ ^((3))xi{ }^{(3)} \xi(3)ξ 's, taken at random, will often cross each other one or more times. When one shakes the ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} y(4)y apart, one therefore gets enormously more ( 3 ) y ( 3 ) y ^((3))y^('){ }^{(3)} y^{\prime}(3)y 's "spread out over the floor" than might have been imagined. Conversely, when one puts back together all of the ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) S's lying on the leaf of history, one gets a structure with a rigidity that might not otherwise have been foreseen. This rigidity arises from the infinitely rich interleaving and intercrossing of clear-cut, well-defined ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) g's one with another.
In summary: (1) Classical geometrodynamics in principle constitutes a device, an algorithm, a rule for calculating and constructing a leaf of history that slices through superspace. (2) The ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) 'g's that lie on this leaf of history are YES 3-geometries; the vastly more numerous ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} y(3)y 's that do not are NO 3-geometries. (3) The YES ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} y(3)y 's are the building blocks of the ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} y(4)y that is classical spacetime. (4) The interweaving and interconnections of these building blocks give the ( 4 ) y ( 4 ) y ^((4))y{ }^{(4)} \mathscr{y}(4)y its existence, its dimensionality, and its structure. (5) In this structure every ( 3 ) y ( 3 ) y ^((3))y{ }^{(3)} \mathscr{y}(3)y has a rigidly fixed location of its own. (6) In this sense one can say that the "many-fingered time" of each 3-geometry is specified by the very interlocking construction itself. Baierlein, Sharp and Wheeler (1962) say a little more on this concept of "3-geometry as carrier of information about time."
How different from the textbook concept of spacetime! There the geometry of spacetime is conceived as constructed out of elementary objects, or points, known as "events." Here, by contrast, the primary concept is 3-geometry, in abstracto, and out of it is derived the idea of event. Thus, (1) the event lies at the intersection of such and such ( 3 ) Θ ( 3 ) Θ ^((3))Theta^('){ }^{(3)} \Theta^{\prime}(3)Θ 's; and (2) it has a timelike relation to (earlier or later than, or synchronous with) some other ( 3 ) ( 3 ) ^((3))ℓ{ }^{(3)} \ell(3), which in turn (3) derives from the intercrossings of all the other ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) g's.
When one turns from classical theory to quantum theory, one gives up the concept of spacetime, except in the semiclassical approximation. Therefore, one gives up any immediate possibility whatsoever of defining the concept, normally regarded as so elemental, of an "event." The theory itself, however, here as always [Bohr and Rosenfeld (1933)], defines in and by itself, in its own natural way, the procedures-in-principle for measuring all those quantities that are in principle measurable.
Quantum theory upsets the sharp distinction between YES 3-geometries and NO
3-geometries. It assigns to each 3-geometry not a YES or a NO, but a probability amplitude,
(43.1) ψ = ψ ( ( 3 ) ξ ) (43.1) ψ = ψ ( 3 ) ξ {:(43.1)psi=psi(^((3))xi):}\begin{equation*} \psi=\psi\left({ }^{(3)} \xi\right) \tag{43.1} \end{equation*}(43.1)ψ=ψ((3)ξ)
This probability amplitude is highest near the classically forecast leaf of history and falls off steeply outside a zone of finite thickness extending a little way on either side of the leaf.
Were one to take, instead of a physically relevant probability amplitude function, a typical solution of the relevant wave equation, one would have to expect to see not one trace of anything like classical geometrodynamics. The typical probability amplitude function is spread all over superspace. No surprise! Already in classical theory one has to reckon with a Hamilton-Jacobi function,
(43.2) S = S ( ( 3 ) ξ ) (43.2) S = S ( 3 ) ξ {:(43.2)S=S(^((3))xi):}\begin{equation*} S=S\left({ }^{(3)} \xi\right) \tag{43.2} \end{equation*}(43.2)S=S((3)ξ)
spread out over superspace. Moreover, this "dynamic phase function" of classical geometrodynamics gives at once the phase of ψ ψ psi\psiψ, according to the formula
(43.3) ψ ( ( 3 ) y ) = ( slowly varying amplitude function ) e ( i / ) S ( ( 3 ) ) (43.3) ψ ( 3 ) y = (  slowly varying   amplitude function  ) e ( i / ) S ( 3 ) {:(43.3)psi(^((3))y)=((" slowly varying ")/(" amplitude function "))e^((i//ℏ)S(^((3))ℓ)):}\begin{equation*} \psi\left({ }^{(3)} y\right)=\binom{\text { slowly varying }}{\text { amplitude function }} e^{(i / \hbar) S\left(^{(3)} \ell\right)} \tag{43.3} \end{equation*}(43.3)ψ((3)y)=( slowly varying  amplitude function )e(i/)S((3))
indication enough that ψ ψ psi\psiψ and S S SSS are both unlocalized.
Dynamics first clearly becomes recognizable when sufficiently many such spreadout probability amplitude functions are superposed to build up a localized wave packet, as in the elementary examples of Boxes 25.3 and 25.4 ; thus,
(43.4) ψ = c 1 ψ 1 + c 2 ψ 2 + (43.4) ψ = c 1 ψ 1 + c 2 ψ 2 + {:(43.4)psi=c_(1)psi_(1)+c_(2)psi_(2)+cdots:}\begin{equation*} \psi=c_{1} \psi_{1}+c_{2} \psi_{2}+\cdots \tag{43.4} \end{equation*}(43.4)ψ=c1ψ1+c2ψ2+
Constructive interference occurs where the phases of the several individual waves agree:
(43.5) S 1 ( ( 3 ) ξ ) = S 2 ( ( 3 ) ξ ) = (43.5) S 1 ( 3 ) ξ = S 2 ( 3 ) ξ = {:(43.5)S_(1)(^((3))xi)=S_(2)(^((3))xi)=cdots:}\begin{equation*} S_{1}\left({ }^{(3)} \xi\right)=S_{2}\left({ }^{(3)} \xi\right)=\cdots \tag{43.5} \end{equation*}(43.5)S1((3)ξ)=S2((3)ξ)=
This is the condition that distinguishes YES 3-geometries from NO 3-geometries. It is the tool for constructing a leaf of history in superspace. It is the key to the dynamics of geometry. Moreover, it is an equation that says not one word about the quantum principle. It is not surprising that the equation of constructive interference in (43.5) makes the leap from classical theory to quantum theory the shortest.

§43.3. THE EINSTEIN-HAMILTON-JACOBI EQUATION

Should one write down a differential equation for the Hamilton-Jacobi function S ( ( 3 ) ϑ ) S ( 3 ) ϑ S(^((3))vartheta)S\left({ }^{(3)} \vartheta\right)S((3)ϑ), solve it, and then analyze the properties of the solution? The exact opposite is simpler: look at the properties of the solution, and from that inspection find out what equation the dynamic phase or action S S SSS must satisfy.
Probability amplitude for a 3-geometry
Wave packet recovers classical geometrodynamics
Hilberts' principle of least action reads
(43.6) I Hilbert = ( 1 / 16 π ) ( 4 ) R ( g ) 1 / 2 d 4 x = extremum. (43.6) I Hilbert  = ( 1 / 16 π ) ( 4 ) R ( g ) 1 / 2 d 4 x =  extremum.  {:(43.6)I_("Hilbert ")=(1//16 pi)int^((4))R(-g)^(1//2)d^(4)x=" extremum. ":}\begin{equation*} I_{\text {Hilbert }}=(1 / 16 \pi) \int{ }^{(4)} R(-g)^{1 / 2} d^{4} x=\text { extremum. } \tag{43.6} \end{equation*}(43.6)IHilbert =(1/16π)(4)R(g)1/2d4x= extremum. 
After one separates off complete derivatives in the integrand, what is left [see equations (21.13) and (21.95)] becomes
( 1 / 16 π ) I ADM = I true = ( 1 / 16 π ) { π i j g i j / t + N g 1 / 2 R (43.7) + N g 1 / 2 [ 1 2 ( Tr π ) 2 Tr ( Π 2 ) ] + 2 N i π i j j } d 4 x ( 1 / 16 π ) I ADM = I true  = ( 1 / 16 π ) π i j g i j / t + N g 1 / 2 R (43.7) + N g 1 / 2 1 2 ( Tr π ) 2 Tr Π 2 + 2 N i π i j j d 4 x {:[(1//16 pi)I_(ADM)=I_("true ")=(1//16 pi)int{pi^(ij)delg_(ij)//del t+Ng^(1//2)R:}],[(43.7){:+Ng^(-1//2)[(1)/(2)(Tr pi)^(2)-Tr(Pi^(2))]+2N_(i)pi^(ij)_(∣j)}d^(4)x]:}\begin{align*} (1 / 16 \pi) I_{\mathrm{ADM}}= & I_{\text {true }}=(1 / 16 \pi) \int\left\{\pi^{i j} \partial g_{i j} / \partial t+N g^{1 / 2} R\right. \\ & \left.+N g^{-1 / 2}\left[\frac{1}{2}(\operatorname{Tr} \boldsymbol{\pi})^{2}-\operatorname{Tr}\left(\boldsymbol{\Pi}^{2}\right)\right]+2 N_{i} \pi^{i j}{ }_{\mid j}\right\} d^{4} x \tag{43.7} \end{align*}(1/16π)IADM=Itrue =(1/16π){πijgij/t+Ng1/2R(43.7)+Ng1/2[12(Trπ)2Tr(Π2)]+2Niπijj}d4x
In (43.7), but not in (43.6), g g ggg stands for the determinant of the three-dimensional metric tensor, g i j g i j g_(ij)g_{i j}gij, and R R RRR for the scalar curvature invariant of the 3-geometry; the suffix ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) is omitted for simplicity. The integral is extended from (1) a spacelike hypersurface on which a 3-geometry is given with metric g i j ( x , y , z ) g i j ( x , y , z ) g_(ij)^(')(x,y,z)g_{i j}{ }^{\prime}(x, y, z)gij(x,y,z) to (2) a spacelike hypersurface on which a 3 -geometry is given with metric g i j ( x , y , z ) g i j ( x , y , z ) g_(ij)^('')(x,y,z)g_{i j}{ }^{\prime \prime}(x, y, z)gij(x,y,z). Whatever is adjustable in the chunk of spacetime between is now to be considered as having been adjusted to extremize the integral. Therefore the value of the integral I ADM I ADM I_(ADM)I_{\mathrm{ADM}}IADM becomes a functional of the metrics on the two hypersurfaces and nothing more.
Next, holding fixed the metric g i j ( x , y , z ) g i j ( x , y , z ) g_(ij)^(')(x,y,z)g_{i j}^{\prime}(x, y, z)gij(x,y,z) on the earlier hypersurface, change slightly or even more than slightly the metric on the later hypersurface. Solve the new variation problem and get a new value of I ADM I ADM I_(ADM)I_{\mathrm{ADM}}IADM. Proceeding further in this way, for each new g i j g i j g_(ij)^('')g_{i j}{ }^{\prime \prime}gij one gets a new value of I ADM I ADM I_(ADM)I_{\mathrm{ADM}}IADM. Call the functional I ADM I ADM I_(ADM)I_{\mathrm{ADM}}IADM of the metric defined in this way "Hamilton's principal function," or the "action" or the "dynamic path length,*" S ( g i j ( x , y , z ) ) S g i j ( x , y , z ) S(g_(ij)(x,y,z))S\left(g_{i j}(x, y, z)\right)S(gij(x,y,z)) of the "history of geometry" that connects the two given 3-geometries. The double prime suffix is dropped from g i j g i j g_(ij)^('')g_{i j}{ }^{\prime \prime}gij here and hereafter to simplify the notation. One knows from other branches of mechanics that the quantity defined in this way, S ( g i j ) S g i j S(g_(ij))S\left(g_{i j}\right)S(gij), when it exists, even though it is a special solution, nevertheless is always a solution of the Hamilton-Jacobi equation. Jacobi could look for more general solutions, but Hamilton already had one!
For (43.7) to be an extremal with respect to variations of the lapse N N NNN and the shift components N i N i N_(i)N_{i}Ni, it was necessary (see Chapter 21) that the coefficients of these four quantities should vanish; thus,
(43.8) g 1 / 2 [ 1 2 ( Tr π ) 2 Tr π 2 ] + g 1 / 2 R = 0 (43.8) g 1 / 2 1 2 ( Tr π ) 2 Tr π 2 + g 1 / 2 R = 0 {:(43.8)g^(-1//2)[(1)/(2)(Tr pi)^(2)-Trpi^(2)]+g^(1//2)R=0:}\begin{equation*} g^{-1 / 2}\left[\frac{1}{2}(\operatorname{Tr} \boldsymbol{\pi})^{2}-\operatorname{Tr} \boldsymbol{\pi}^{2}\right]+g^{1 / 2} R=0 \tag{43.8} \end{equation*}(43.8)g1/2[12(Trπ)2Trπ2]+g1/2R=0
and
(43.9) π i j 1 j = 0 . (43.9) π i j 1 j = 0 . {:(43.9)pi^(ij)_(1j)=0.:}\begin{equation*} \pi^{i j}{ }_{1 j}=0 . \tag{43.9} \end{equation*}(43.9)πij1j=0.
In the expression for the extremal value of the action, only one term, the first, is left:
(43.10) S ( g ( x , y , z ) ) = I ADM , extremal = g i j g i j { π i j g i j / t } d 4 x (43.10) S ( g ( x , y , z ) ) = I ADM ,  extremal  = g i j g i j π i j g i j / t d 4 x {:(43.10)S(g(x","y","z))=I_(ADM," extremal ")=int_(g_(ij)^('))^(g_(ij)){pi^(ij)delg_(ij)//del t}d^(4)x:}\begin{equation*} S(g(x, y, z))=I_{\mathrm{ADM}, \text { extremal }}=\int_{g_{i j}^{\prime}}^{g_{i j}}\left\{\pi^{i j} \partial g_{i j} / \partial t\right\} d^{4} x \tag{43.10} \end{equation*}(43.10)S(g(x,y,z))=IADM, extremal =gijgij{πijgij/t}d4x
The effect of a slight change, δ g i j δ g i j deltag_(ij)\delta g_{i j}δgij, in the 3-metric at the upper limit is therefore easy to read off:
(43.11) δ S = π i j ( x , y , z ) δ g i j ( x , y , z ) d 3 x (43.11) δ S = π i j ( x , y , z ) δ g i j ( x , y , z ) d 3 x {:(43.11)delta S=intpi^(ij)(x","y","z)deltag_(ij)(x","y","z)d^(3)x:}\begin{equation*} \delta S=\int \pi^{i j}(x, y, z) \delta g_{i j}(x, y, z) d^{3} x \tag{43.11} \end{equation*}(43.11)δS=πij(x,y,z)δgij(x,y,z)d3x
The language of "functional derivative" [see, for example, Bogoliubov and Shirkov (1959)] allows one to speak in terms of a derivative rather than an integral:
(43.12) δ S δ g i j = π i j (43.12) δ S δ g i j = π i j {:(43.12)(delta S)/(deltag_(ij))=pi^(ij):}\begin{equation*} \frac{\delta S}{\delta g_{i j}}=\pi^{i j} \tag{43.12} \end{equation*}(43.12)δSδgij=πij
The "field momenta" acquire a simple meaning: they give the rate of change of the action with respect to the continuous infinitude of "field coordinates," g i j ( x , y , z ) g i j ( x , y , z ) g_(ij)(x,y,z)g_{i j}(x, y, z)gij(x,y,z). (Here the x , y , z x , y , z x,y,zx, y, zx,y,z, as well as the i i iii and j j jjj, serve as mere labels.)
Although the phase function S S SSS appears to depend on all six metric coefficients g i j g i j g_(ij)g_{i j}gij individually, it depends in actuality only on that combination of the g i j g i j g_(ij)g_{i j}gij which is locked to the 3 -geometry. To verify this point, express a particular 3-geometry ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) ) throughout one local coordinate patch in terms of one set of coordinates x p x p x^(p)x^{p}xp by one set of metric coefficients g p q g p q g_(pq)g_{p q}gpq. Reexpress the same 3 -geometry in terms of coordinates x ¯ p x ¯ p bar(x)^(p)\bar{x}^{p}x¯p shifted by the small amount ξ p ξ p xi^(p)\xi^{p}ξp,
(43.13) x ¯ p = x p ξ p . (43.13) x ¯ p = x p ξ p . {:(43.13) bar(x)^(p)=x^(p)-xi^(p).:}\begin{equation*} \bar{x}^{p}=x^{p}-\xi^{p} . \tag{43.13} \end{equation*}(43.13)x¯p=xpξp.
To keep the 3-geometry the same, that is, to keep unchanged the distance d s d s dsd sds from one coordinate-independent point to another, the metric coefficients have to change:
(43.14) g ¯ p q = g p q + ξ p q + ξ q p (43.14) g ¯ p q = g p q + ξ p q + ξ q p {:(43.14) bar(g)_(pq)=g_(pq)+xi_(p∣q)+xi_(q∣p):}\begin{equation*} \bar{g}_{p q}=g_{p q}+\xi_{p \mid q}+\xi_{q \mid p} \tag{43.14} \end{equation*}(43.14)g¯pq=gpq+ξpq+ξqp
Let the phase function S S SSS (or in quantum mechanics, let the probability amplitude ψ ) ψ ) psi)\psi)ψ) be considered to be expressed as a functional of the metric coefficients g 11 ( x ) g 11 ( x ) g_(11)(x)g_{11}(x)g11(x), g 12 ( x ) , , g 33 ( x ) g 12 ( x ) , , g 33 ( x ) g_(12)(x),dots,g_(33)(x)g_{12}(x), \ldots, g_{33}(x)g12(x),,g33(x). Changes δ g p q ( x ) δ g p q ( x ) deltag_(pq)(x)\delta g_{p q}(x)δgpq(x) in these coefficients alter the H-J phase function and the probability amplitude by the amounts
δ S = ( δ S / δ g p q ) δ g p q d 3 x (43.15) δ ψ = ( δ ψ / δ g p q ) δ g p q d 3 x δ S = δ S / δ g p q δ g p q d 3 x (43.15) δ ψ = δ ψ / δ g p q δ g p q d 3 x {:[delta S=int(delta S//deltag_(pq))deltag_(pq)d^(3)x],[(43.15)delta psi=int(delta psi//deltag_(pq))deltag_(pq)d^(3)x]:}\begin{align*} \delta S & =\int\left(\delta S / \delta g_{p q}\right) \delta g_{p q} d^{3} x \\ \delta \psi & =\int\left(\delta \psi / \delta g_{p q}\right) \delta g_{p q} d^{3} x \tag{43.15} \end{align*}δS=(δS/δgpq)δgpqd3x(43.15)δψ=(δψ/δgpq)δgpqd3x
according to the standard definition of functional derivative. Therefore the coordinate change produces an ostensible change in the dynamic path length or phase S S SSS given by
(43.16) δ S = ( δ S / δ g p q ) ( ξ p q + ξ q p ) d 3 x = 2 ( δ S / δ g p q ) q ξ p d 3 x . (43.16) δ S = δ S / δ g p q ξ p q + ξ q p d 3 x = 2 δ S / δ g p q q ξ p d 3 x . {:[(43.16)delta S=int(delta S//deltag_(pq))(xi_(p∣q)+xi_(q∣p))d^(3)x],[=-2int(delta S//deltag_(pq))_(∣q)xi_(p)d^(3)x.]:}\begin{align*} \delta S & =\int\left(\delta S / \delta g_{p q}\right)\left(\xi_{p \mid q}+\xi_{q \mid p}\right) d^{3} x \tag{43.16}\\ & =-2 \int\left(\delta S / \delta g_{p q}\right)_{\mid q} \xi_{p} d^{3} x . \end{align*}(43.16)δS=(δS/δgpq)(ξpq+ξqp)d3x=2(δS/δgpq)qξpd3x.
This change must vanish if S S SSS is to depend on the 3-geometry alone, and not on
the coordinates in terms of which that 3-geometry is expressed; and must vanish, moreover, for arbitrary choice of the ξ p ξ p xi_(p)\xi_{p}ξp. From this condition, one concludes
(43.17) ( δ S δ g p q ) q = 0 . (43.17) δ S δ g p q q = 0 . {:(43.17)((delta S)/(deltag_(pq)))_(∣q)=0.:}\begin{equation*} \left(\frac{\delta S}{\delta g_{p q}}\right)_{\mid q}=0 . \tag{43.17} \end{equation*}(43.17)(δSδgpq)q=0.
Likewise, one finds three equations on the wave function ψ ψ psi\psiψ itself, as distinguished from its phase S / S / S//ℏS / \hbarS/; thus,
(43.18) ( δ ψ δ g p q ) q = 0 . (43.18) δ ψ δ g p q q = 0 . {:(43.18)((delta psi)/(deltag_(pq)))_(∣q)=0.:}\begin{equation*} \left(\frac{\delta \psi}{\delta g_{p q}}\right)_{\mid q}=0 . \tag{43.18} \end{equation*}(43.18)(δψδgpq)q=0.
But (43.17), by virtue of (43.12), is identical with (43.9). In this sense (43.9) merely verifies what one already knew had to be true: the classical Hamilton-Jacobi function S S SSS (like the probability amplitude function ψ ψ psi\psiψ of quantum theory) depends on 3-geometry, not on individual metric coefficients, and not on choice of coordinates.
All the dynamic content of geometrodynamics is summarized in the sole remaining equation (43.8), which takes the form
(43.19) g 1 / 2 [ 1 2 g p q g r s g p r g q s ] δ S δ g p q δ S δ g r s + g 1 / 2 R = 0 (43.19) g 1 / 2 1 2 g p q g r s g p r g q s δ S δ g p q δ S δ g r s + g 1 / 2 R = 0 {:(43.19)g^(-1//2)[(1)/(2)g_(pq)g_(rs)-g_(pr)g_(qs)](delta S)/(deltag_(pq))(delta S)/(deltag_(rs))+g^(1//2)R=0:}\begin{equation*} g^{-1 / 2}\left[\frac{1}{2} g_{p q} g_{r s}-g_{p r} g_{q s}\right] \frac{\delta S}{\delta g_{p q}} \frac{\delta S}{\delta g_{r s}}+g^{1 / 2} R=0 \tag{43.19} \end{equation*}(43.19)g1/2[12gpqgrsgprgqs]δSδgpqδSδgrs+g1/2R=0
This is the Einstein-Hamilton-Jacobi equation, first given explicitly in the literature by Peres (1962) on the foundation of earlier work by himself and others on the Hamiltonian formulation of geometrodynamics. This equation tells how fronts of constant S S SSS ("wave crests") propagate in superspace.
That the one EHJ equation (43.19) contains as much information as all ten components of Einstein's field equation has been demonstrated by Gerlach (1969). The central point in the analysis is the principle of constructive interference, and the main requirement for a proper treatment of this point is the concept of a completely parametrized solution of the EHJ equation.
The problem of a particle moving in two-dimensional space, as treated by the Hamilton-Jacobi method in Boxes 25.3 and 25.4, required for complete analysis a solution containing two distinct and independently adjustable parameters, the energy per unit mass, E ~ E ~ widetilde(E)\widetilde{E}E~, and angular momentum per unit mass, L ~ L ~ widetilde(L)\widetilde{L}L~; thus
S ( r , θ , t ; E ~ , L ~ ) = E ~ t + L ~ θ (43.20) + r [ E ~ 2 ( 1 2 M / r ) ( 1 + L ~ 2 / r 2 ) ] 1 / 2 d r ( 1 2 M / r ) + δ ( E ~ , L ~ ) S ( r , θ , t ; E ~ , L ~ ) = E ~ t + L ~ θ (43.20) + r E ~ 2 ( 1 2 M / r ) 1 + L ~ 2 / r 2 1 / 2 d r ( 1 2 M / r ) + δ ( E ~ , L ~ ) {:[S(r","theta","t; widetilde(E)"," widetilde(L))=- widetilde(E)t+ widetilde(L)theta],[(43.20)+int^(r)[ widetilde(E)^(2)-(1-2M//r)(1+ widetilde(L)^(2)//r^(2))]^(1//2)(dr)/((1-2M//r))+delta( widetilde(E)"," widetilde(L))]:}\begin{align*} & S(r, \theta, t ; \widetilde{E}, \widetilde{L})=-\widetilde{E} t+\widetilde{L} \theta \\ & +\int^{r}\left[\widetilde{E}^{2}-(1-2 M / r)\left(1+\widetilde{L}^{2} / r^{2}\right)\right]^{1 / 2} \frac{d r}{(1-2 M / r)}+\delta(\widetilde{E}, \widetilde{L}) \tag{43.20} \end{align*}S(r,θ,t;E~,L~)=E~t+L~θ(43.20)+r[E~2(12M/r)(1+L~2/r2)]1/2dr(12M/r)+δ(E~,L~)
Here the additive phase δ ( E ~ , L ~ ) δ ( E ~ , L ~ ) delta( widetilde(E), widetilde(L))\delta(\widetilde{E}, \widetilde{L})δ(E~,L~) is required if one is to be able to arrange for the particle to arrive at a given r r rrr-value at a specified t t ttt value and at a specified value of θ θ theta\thetaθ. One thinks of superposing four probability amplitudes, as in (43.4), with dynamic phases, S S SSS, given by ( 43.20 ) ( 43.20 ) (43.20)(43.20)(43.20) and the parameters taking on, respectively, the following four sets of values: ( E ~ , L ~ ) ; ( E ~ + Δ E ~ , L ~ ) ; ( E ~ , L ~ + Δ L ~ ) ( E ~ , L ~ ) ; ( E ~ + Δ E ~ , L ~ ) ; ( E ~ , L ~ + Δ L ~ ) ( widetilde(E), widetilde(L));( widetilde(E)+Delta widetilde(E), widetilde(L));( widetilde(E), widetilde(L)+Delta widetilde(L))(\widetilde{E}, \widetilde{L}) ;(\widetilde{E}+\Delta \widetilde{E}, \widetilde{L}) ;(\widetilde{E}, \widetilde{L}+\Delta \widetilde{L})(E~,L~);(E~+ΔE~,L~);(E~,L~+ΔL~); and ( E ~ + Δ E ~ ( E ~ + Δ E ~ ( widetilde(E)+Delta widetilde(E)(\widetilde{E}+\Delta \widetilde{E}(E~+ΔE~, L ~ + Δ L ~ ) L ~ + Δ L ~ ) widetilde(L)+Delta widetilde(L))\widetilde{L}+\Delta \widetilde{L})L~+ΔL~). The principle of constructive interference leads to the conditions
S / E ~ = 0 (43.21) S / L ~ = 0 . S / E ~ = 0 (43.21) S / L ~ = 0 . {:[del S//del widetilde(E)=0],[(43.21)del S//del widetilde(L)=0.]:}\begin{align*} & \partial S / \partial \widetilde{E}=0 \\ & \partial S / \partial \widetilde{L}=0 . \tag{43.21} \end{align*}S/E~=0(43.21)S/L~=0.
The points in the spacetime ( r , θ , t ) ( r , θ , t ) (r,theta,t)(r, \theta, t)(r,θ,t) that satisfy these conditions are the YES points; they lie on the world line. The ones that don't are the NO points.
The desired solution of the EHJ equation (43.19) contains not two parameters (plus an additive phase, δ δ delta\deltaδ, depending on these two parameters), but an infinity of parameters, and even a continuous infinity of parameters. Thus the parameters are not to be designated as α 1 , α 2 , ; β 1 , β 2 , α 1 , α 2 , ; β 1 , β 2 , alpha_(1),alpha_(2),dots;beta_(1),beta_(2),dots\alpha_{1}, \alpha_{2}, \ldots ; \beta_{1}, \beta_{2}, \ldotsα1,α2,;β1,β2, (parameters labeled by a discrete index), but as
α ( u , v , w ) α ( u , v , w ) alpha(u,v,w)\alpha(u, v, w)α(u,v,w)
and
β ( u , v , w ) β ( u , v , w ) beta(u,v,w)\beta(u, v, w)β(u,v,w)
(two parameters "labeled" by three continuous indices u , v , w u , v , w u,v,wu, v, wu,v,w ). Accidentally omit one of this infinitude of parameters? How could one ever hope to know that what purported to be a complete solution of the EHJ equation was not in actuality complete? Happily Gerlach provides a procedure to test the parameters for completeness.
Granted completeness, Gerlach goes on to show that the "leaf of history in superspace" or collection of 3-geometries that satisfy the conditions of constructive interference,
δ S ( ( 3 ) ξ ; α ( u , v , w ) , β ( u , v , w ) ) δ α = 0 , (43.22) δ S ( ( 3 ) ; ; α ( u , v , w ) , β ( u , v , w ) ) δ β = 0 , δ S ( 3 ) ξ ; α ( u , v , w ) , β ( u , v , w ) δ α = 0 , (43.22) δ S ( 3 ) ; ; α ( u , v , w ) , β ( u , v , w ) δ β = 0 , {:[(delta S(^((3))xi;alpha(u,v,w),beta(u,v,w)))/(delta alpha)=0","],[(43.22)(delta S(^((3));;alpha(u,v,w),beta(u,v,w)))/(delta beta)=0","]:}\begin{align*} & \frac{\delta S\left({ }^{(3)} \xi ; \alpha(u, v, w), \beta(u, v, w)\right)}{\delta \alpha}=0, \\ & \frac{\delta S\left({ }^{(3)} ; ; \alpha(u, v, w), \beta(u, v, w)\right)}{\delta \beta}=0, \tag{43.22} \end{align*}δS((3)ξ;α(u,v,w),β(u,v,w))δα=0,(43.22)δS((3);;α(u,v,w),β(u,v,w))δβ=0,
is identical with the leaf of history, or equivalent 4 -geometry, given by the ten components of Einstein's geometrodynamic law.
From the Hamilton-Jacobi equation for a problem in elementary mechanics, it is a short step to the corresponding Schroedinger equation; similarly in geometrodynamics. No one has done more than Bryce DeWitt to explore the meaning and consequences of this "Einstein-Schroedinger equation" [DeWitt (1967a,b)]. One of the most interesting consequences is the existence of a conserved current in superspace, analogous to the conserved current
j μ = 2 i m ( ψ ψ x μ ψ ψ x μ ) j μ = 2 i m ψ ψ x μ ψ ψ x μ j_(mu)=(ℏ)/(2im)(psi^(**)(del psi)/(delx^(mu))-psi(delpsi^(**))/(delx^(mu)))j_{\mu}=\frac{\hbar}{2 i m}\left(\psi^{*} \frac{\partial \psi}{\partial x^{\mu}}-\psi \frac{\partial \psi^{*}}{\partial x^{\mu}}\right)jμ=2im(ψψxμψψxμ)
that one encounters in the Klein-Gordon wave equation for a particle of spin zero.
It is an unhappy feature of this Einstein-Schroedinger wave equation that it contains second derivatives; so one has to specify both the probability amplitude, and the normal derivative of the probability amplitude, on the appropriate "super-
Condition of constructive interference gives classical "leaf of history" or spacetime
hypersurface" in superspace, in order to be able to predict the evolution of this state function elsewhere in superspace. One suggested way out of this situation-it is at least an inconvenience, possibly a real difficulty-has been proposed by Leutwyler (1968): impose a natural boundary condition that reduces the number of independent solutions from the number characteristic of a second-order equation to the number characteristic of a first-order equation. Another way out is to formulate the dynamics quite differently, in the way proposed by Kuchař (see Chapter 21), in which the resulting equation is of first order in the variable analogous to time.
The exploration of quantum geometrodynamics is simplified when one treats most of the degrees of freedom of the geometry as frozen out, by imposition of a high degree of symmetry. Then one is left with one, two, or three degrees of freedom (see Chapter 30, on mixmaster cosmology), or even infinitely many, and is led to manageable problems of quantum mechanics [Misner (1972a, 1973)].

§43.4. FLUCTUATIONS IN GEOMETRY

Of all the remarkable developments of physics since World War II, none is more impressive than the prediction and verification of the effects of the vacuum fluctuations in the electromagnetic field on the motion of the electron in the hydrogen atom (Figure 43.3). That development made it impossible to overlook the effects of such fluctuations throughout all physics and, not least, in the geometry of spacetime itself.
Figure 43.3.
Symbolic representation of motion of electron in hydrogen atom as affected by fluctuations in electric field in vacuum ("vacuum" or "ground state" or "zero-point" fluctuations). The electric field associated with the fluctuation, E x ( t ) = E x ( ω ) e i ω t d ω E x ( t ) = E x ( ω ) e i ω t d ω E_(x)(t)=intE_(x)(omega)e^(-i omega t)d omegaE_{x}(t)=\int E_{x}(\omega) e^{-i \omega t} d \omegaEx(t)=Ex(ω)eiωtdω, adds to the static electric field provided by the nucleus itself. The additional field brings about in the most elementary approximation the displacement Δ x = ( e / m ω 2 ) E x ( ω ) e i ω t d ω Δ x = e / m ω 2 E x ( ω ) e i ω t d ω Delta x=int(e//momega^(2))E_(x)(omega)e^(-i omega t)d omega\Delta x=\int\left(e / m \omega^{2}\right) E_{x}(\omega) e^{-i \omega t} d \omegaΔx=(e/mω2)Ex(ω)eiωtdω. The average vanishes but the root mean square ( Δ x ) 2 ( Δ x ) 2 (:(Delta x)^(2):)\left\langle(\Delta x)^{2}\right\rangle(Δx)2 does not. In consequence the electron feels an effective atomic potential altered from the expected value V ( x , y , z ) V ( x , y , z ) V(x,y,z)V(x, y, z)V(x,y,z) by the amount
Δ V ( x , y , z ) = 1 2 ( Δ x ) 2 2 V ( x , y , z ) Δ V ( x , y , z ) = 1 2 ( Δ x ) 2 2 V ( x , y , z ) Delta V(x,y,z)=(1)/(2)(:(Delta x)^(2):)grad^(2)V(x,y,z)\Delta V(x, y, z)=\frac{1}{2}\left\langle(\Delta x)^{2}\right\rangle \nabla^{2} V(x, y, z)ΔV(x,y,z)=12(Δx)22V(x,y,z)
The average of this perturbation over the unperturbed motion accounts for the major part of the observed Lamb-Retherford shift Δ E = Δ V ( x , y , z ) Δ E = Δ V ( x , y , z ) Delta E=(:Delta V(x,y,z):)\Delta E=\langle\Delta V(x, y, z)\rangleΔE=ΔV(x,y,z) in the energy level. Conversely, the observation of the expected shift makes the reality of the vacuum fluctuations inescapably evident.
From the zero-point fluctuations of a single oscillator to the fluctuations of the electromagnetic field to geometrodynamic fluctuations is a natural order of progression.
A harmonic oscillator in its ground state has a probability amplitude of
(43.23) ψ ( x ) = ( m ω π ) 1 / 4 e ( m ω / 2 h ) x 2 (43.23) ψ ( x ) = m ω π 1 / 4 e ( m ω / 2 h ) x 2 {:(43.23)psi(x)=((m omega)/(piℏ))^(1//4)e^(-(m omega//2h)x^(2)):}\begin{equation*} \psi(x)=\left(\frac{m \omega}{\pi \hbar}\right)^{1 / 4} e^{-(m \omega / 2 h) x^{2}} \tag{43.23} \end{equation*}(43.23)ψ(x)=(mωπ)1/4e(mω/2h)x2
to be displaced by the distance x x xxx from its natural classical position of equilibrium. In this sense, it may be said to "resonate" or "fluctuate" between locations in space ranging over a region of extent
(43.24) Δ x ( / m ω ) 1 / 2 . (43.24) Δ x ( / m ω ) 1 / 2 . {:(43.24)Delta x∼(ℏ//m omega)^(1//2).:}\begin{equation*} \Delta x \sim(\hbar / m \omega)^{1 / 2} . \tag{43.24} \end{equation*}(43.24)Δx(/mω)1/2.
The electromagnetic field can be treated as an infinite collection of independent "field oscillators," with amplitudes ξ 1 , ξ 2 , ξ 1 , ξ 2 , xi_(1),xi_(2),dots\xi_{1}, \xi_{2}, \ldotsξ1,ξ2,. When the Maxwell field is in its state of lowest energy, the probability amplitude-for the first oscillator to have amplitude ξ 1 ξ 1 xi_(1)\xi_{1}ξ1, and simultaneously the second oscillator to have amplitude ξ 2 ξ 2 xi_(2)\xi_{2}ξ2, the third ξ 3 ξ 3 xi_(3)\xi_{3}ξ3, and so on-is the product of functions of the form (43.23), one for each oscillator. When the scale of amplitudes for each oscillator is suitably normalized, the resulting infinite product takes the form
(43.25) ψ ( ξ 1 , ξ 2 , ) = N exp [ ( ξ 1 2 + ξ 2 2 + ) ] (43.25) ψ ξ 1 , ξ 2 , = N exp ξ 1 2 + ξ 2 2 + {:(43.25)psi(xi_(1),xi_(2),dots)=N exp[-(xi_(1)^(2)+xi_(2)^(2)+cdots)]:}\begin{equation*} \psi\left(\xi_{1}, \xi_{2}, \ldots\right)=N \exp \left[-\left(\xi_{1}^{2}+\xi_{2}^{2}+\cdots\right)\right] \tag{43.25} \end{equation*}(43.25)ψ(ξ1,ξ2,)=Nexp[(ξ12+ξ22+)]
This expression gives the probability amplitude ψ ψ psi\psiψ for a configuration B ( x , y , z ) B ( x , y , z ) B(x,y,z)\boldsymbol{B}(x, y, z)B(x,y,z) of the magnetic field that is described by the Fourier coefficients ξ 1 , ξ 2 , ξ 1 , ξ 2 , xi_(1),xi_(2),dots\xi_{1}, \xi_{2}, \ldotsξ1,ξ2,. One can forgo any mention of these Fourier coefficients if one desires, however, and rewrite (43.25) directly in terms of the magnetic field configuration itself [Wheeler (1962)]:
(43.26) ψ ( B ( x , y , z ) ) = r exp ( B ( x 1 ) B ( x 2 ) 16 π 3 c r 12 2 d 3 x 1 d 3 x 2 ) . (43.26) ψ ( B ( x , y , z ) ) = r exp B x 1 B x 2 16 π 3 c r 12 2 d 3 x 1 d 3 x 2 . {:(43.26)psi(B(x","y","z))=rexp(-∬(B(x_(1))*B(x_(2)))/(16pi^(3)ℏcr_(12)^(2))d^(3)x_(1)d^(3)x_(2)).:}\begin{equation*} \psi(\boldsymbol{B}(x, y, z))=\mathscr{r} \exp \left(-\iint \frac{\boldsymbol{B}\left(\boldsymbol{x}_{1}\right) \cdot \boldsymbol{B}\left(\boldsymbol{x}_{2}\right)}{16 \pi^{3} \hbar c r_{12}^{2}} d^{3} x_{1} d^{3} x_{2}\right) . \tag{43.26} \end{equation*}(43.26)ψ(B(x,y,z))=rexp(B(x1)B(x2)16π3cr122d3x1d3x2).
One no longer speaks of "the" magnetic field, but instead of the probability of this, that, or the other configuration of the magnetic field, even under circumstances, as here, where the electromagnetic field is in its ground state. [See Kuchař (1970) for a similar expression for the "ground state" functional of the linearized gravitational field.]
It is reasonable enough under these circumstances that the configuration of greatest probability is B ( x , y , z ) = 0 B ( x , y , z ) = 0 B(x,y,z)=0\boldsymbol{B}(x, y, z)=0B(x,y,z)=0. Consider for comparison a configuration where the magnetic field is again everywhere zero except in a region of dimension L L LLL. There let the field, subject as always to the condition div B = 0 div B = 0 div B=0\operatorname{div} \boldsymbol{B}=0divB=0, be of the order of magnitude Δ B Δ B Delta B\Delta BΔB. The probability amplitude for this configuration will be reduced relative to the nil configuration by a factor exp ( I ) exp ( I ) exp(-I)\exp (-I)exp(I). Here the quantity I I III in the exponent is of the order ( Δ B ) 2 L 4 / c ( Δ B ) 2 L 4 / c (Delta B)^(2)L^(4)//ℏc(\Delta B)^{2} L^{4} / \hbar c(ΔB)2L4/c. Configurations for which I I III is large compared to 1 occur with negligible probability. Configurations for which I I III is small compared to 1 occur with practically the same probability as the nil configuration. In this sense, one can
Fluctuations for oscillator and for electromagnetic field
say that the fluctuations in the magnetic field in a region of extension L L LLL are of the order of magnitude
(43.27) Δ B ( c ) 1 / 2 L 2 (43.27) Δ B ( c ) 1 / 2 L 2 {:(43.27)Delta B∼((ℏc)^(1//2))/(L^(2)):}\begin{equation*} \Delta B \sim \frac{(\hbar c)^{1 / 2}}{L^{2}} \tag{43.27} \end{equation*}(43.27)ΔB(c)1/2L2
In other words, the field "resonates" between one configuration and another with the range of configurations of significance given by (43.27). Moreover, the smaller is the region of space under consideration, the larger are the field magnitudes that occur with appreciable probability.
Still another familiar way of speaking about electromagnetic field fluctuations gives additional insight relevant to geometrodynamics. One considers a measuring device responsive in comparable measure to the magnetic field at all points in a region of dimension L L LLL. One asks for the effect on this device of electromagnetic disturbances of various wavelengths. A disturbance of wavelength short compared to L L LLL will cause forces to act one way in some parts of the detector, and will give rise to nearly compensating forces in other parts of it. In contrast, a disturbance of a long wavelength λ λ lambda\lambdaλ produces forces everywhere in the same direction, but of a magnitude too low to have much effect. Thus the field, estimated from the equation
( energy of electromagnetic wave of wavelength λ in a domain of volume λ 3 ) ( energy of one quantum of wavelength λ )  energy of electromagnetic   wave of wavelength  λ  in a   domain of volume  λ 3 (  energy of one quantum   of wavelength  λ ) ([" energy of electromagnetic "],[" wave of wavelength "lambda" in a "],[" domain of volume "lambda^(3)])∼((" energy of one quantum ")/(" of wavelength "lambda))\left(\begin{array}{l} \text { energy of electromagnetic } \\ \text { wave of wavelength } \lambda \text { in a } \\ \text { domain of volume } \lambda^{3} \end{array}\right) \sim\binom{\text { energy of one quantum }}{\text { of wavelength } \lambda}( energy of electromagnetic  wave of wavelength λ in a  domain of volume λ3)( energy of one quantum  of wavelength λ)
or
B 2 λ 3 c λ B 2 λ 3 c λ B^(2)lambda^(3)∼(ℏc)/(lambda)B^{2} \lambda^{3} \sim \frac{\hbar c}{\lambda}B2λ3cλ
or
(43.28) B ( c ) 1 / 2 λ 2 (43.28) B ( c ) 1 / 2 λ 2 {:(43.28)B∼((ℏc)^(1//2))/(lambda^(2)):}\begin{equation*} B \sim \frac{(\hbar c)^{1 / 2}}{\lambda^{2}} \tag{43.28} \end{equation*}(43.28)B(c)1/2λ2
is very small if λ λ lambda\lambdaλ is large compared to the domain size L L LLL. The biggest effect is caused by a disturbance of wavelength λ λ lambda\lambdaλ comparable to L L LLL itself. This line of reasoning leads directly from (43.28) to the standard fluctuation formula (43.27).
Similar considerations apply in geometrodynamics. Quantum fluctuations in the geometry are superposed on and coexist with the large-scale, slowly varying curvature predicted by classical deterministic general relativity. Thus, in a region of dimension L L LLL, where in a local Lorentz frame the normal values of the metric coefficients will be 1 , 1 , 1 , 1 1 , 1 , 1 , 1 -1,1,1,1-1,1,1,11,1,1,1, there will occur fluctuations in these coefficients of the order
(43.29) Δ g L L (43.29) Δ g L L {:(43.29)Delta g∼(L^(**))/(L):}\begin{equation*} \Delta g \sim \frac{L^{*}}{L} \tag{43.29} \end{equation*}(43.29)ΔgLL
fluctuations in the first derivatives of the g i k g i k g_(ik)g_{i k}gik 's of the order
(43.30) Δ Γ Δ g L L L 2 (43.30) Δ Γ Δ g L L L 2 {:(43.30)Delta Gamma∼(Delta g)/(L)∼(L^(**))/(L^(2)):}\begin{equation*} \Delta \Gamma \sim \frac{\Delta g}{L} \sim \frac{L^{*}}{L^{2}} \tag{43.30} \end{equation*}(43.30)ΔΓΔgLLL2
and fluctuations in the curvature of space of the order
(43.31) Δ R Δ g L 2 L L 3 (43.31) Δ R Δ g L 2 L L 3 {:(43.31)Delta R∼(Delta g)/(L^(2))∼(L^(**))/(L^(3)):}\begin{equation*} \Delta R \sim \frac{\Delta g}{L^{2}} \sim \frac{L^{*}}{L^{3}} \tag{43.31} \end{equation*}(43.31)ΔRΔgL2LL3
Here
(43.32) L = ( G c 3 ) 1 / 2 = 1.6 × 10 33 cm (43.32) L = G c 3 1 / 2 = 1.6 × 10 33 cm {:(43.32)L^(**)=((ℏG)/(c^(3)))^(1//2)=1.6 xx10^(-33)cm:}\begin{equation*} L^{*}=\left(\frac{\hbar G}{c^{3}}\right)^{1 / 2}=1.6 \times 10^{-33} \mathrm{~cm} \tag{43.32} \end{equation*}(43.32)L=(Gc3)1/2=1.6×1033 cm
is the so-called Planck length [Planck (1899)].
It is appropriate to look at orders of magnitude. The curvature of space within and near the earth, according to classical Einstein theory, is of the order
R ( G c 2 ) ρ ( 0.7 × 10 28 cm / g ) ( 5 g / cm 3 ) (43.33) 4 × 10 28 cm 2 R G c 2 ρ 0.7 × 10 28 cm / g 5 g / cm 3 (43.33) 4 × 10 28 cm 2 {:[R∼((G)/(c^(2)))rho∼(0.7 xx10^(-28)(cm)//g)(5(g)//cm^(3))],[(43.33)∼4xx10^(-28)cm^(-2)]:}\begin{align*} R & \sim\left(\frac{G}{c^{2}}\right) \rho \sim\left(0.7 \times 10^{-28} \mathrm{~cm} / \mathrm{g}\right)\left(5 \mathrm{~g} / \mathrm{cm}^{3}\right) \\ & \sim 4 \times 10^{-28} \mathrm{~cm}^{-2} \tag{43.33} \end{align*}R(Gc2)ρ(0.7×1028 cm/g)(5 g/cm3)(43.33)4×1028 cm2
This quantity has a very direct physical significance. It measures the "tide-producing component of the gravitational field" as sensed, for example, in a freely falling elevator or in a spaceship in free orbit around the earth. By comparison, the quantum fluctuations in the curvature of space are only
(43.34) Δ R 10 33 cm 2 (43.34) Δ R 10 33 cm 2 {:(43.34)Delta R∼10^(-33)cm^(-2):}\begin{equation*} \Delta R \sim 10^{-33} \mathrm{~cm}^{-2} \tag{43.34} \end{equation*}(43.34)ΔR1033 cm2
even in a domain of observation as small as 1 cm in extent. Thus the quantum fluctuations in the geometry of space are completely negligible under everyday circumstances.
Even in atomic and nuclear physics the fluctuations in the metric,
Δ g 10 33 cm 10 8 cm 10 25 Δ g 10 33 cm 10 8 cm 10 25 Delta g∼(10^(-33)(cm))/(10^(-8)(cm))∼10^(-25)\Delta g \sim \frac{10^{-33} \mathrm{~cm}}{10^{-8} \mathrm{~cm}} \sim 10^{-25}Δg1033 cm108 cm1025
and
(43.35) Δ g 10 33 cm 10 13 cm 10 20 (43.35) Δ g 10 33 cm 10 13 cm 10 20 {:(43.35)Delta g∼(10^(-33)(cm))/(10^(-13)(cm))∼10^(-20):}\begin{equation*} \Delta g \sim \frac{10^{-33} \mathrm{~cm}}{10^{-13} \mathrm{~cm}} \sim 10^{-20} \tag{43.35} \end{equation*}(43.35)Δg1033 cm1013 cm1020
are so small that it is completely in order to idealize the physics as taking place in a flat Lorentzian spacetime manifold.
The quantum fluctuations in the geometry are nevertheless inescapable, if one is to believe the quantum principle and Einstein's theory. They coexist with the geometrodynamic development predicted by classical general relativity. The fluctuations widen the narrow swathe cut through superspace by the classical history of the geometry. In other words, the geometry is not deterministic, even though it looks so at the everyday scale of observation. Instead, at a submicroscopic scale it "resonates" between one configuration and another and another. This terminology means no more and no less than the following: (1) Each configuration ( 3 ) ( 3 ) ^((3)){ }^{(3)}(3) & has its own probability amplitude ψ = ψ ( ( 3 ) ) ψ = ψ ( 3 ) psi=psi(^((3))ℓ)\psi=\psi\left({ }^{(3)} \ell\right)ψ=ψ((3)). (2) These probability amplitudes have comparable magnitudes for a whole range of 3-geometries included within the limits (43.29) on
either side of the classical swathe through superspace. (3) This range of 3-geometries is far too variegated on the submicroscopic sale to fit into any one 4 -geometry, or any one classical geometrodynamic history. (4) Only when one overlooks these small-scale fluctuations ( 10 33 cm 10 33 cm ∼10^(-33)cm\sim 10^{-33} \mathrm{~cm}1033 cm ) and examines the larger-scale features of the 3-geometries do they appear to fit into a single space-time manifold, such as comports with the classical field equations.
These small-scale fluctuations tell one that something like gravitational collapse is taking place everywhere in space and all the time; that gravitational collapse is in effect perpetually being done and undone; that in addition to the gravitational collapse of the universe, and of a star, one has also to deal with a third and, because it is constantly being undone, most significant level of gravitational collapse at the Planck scale of distances.

EXERCISES

Exercise 43.1. THE ACTION PRINCIPLE FOR A FREE PARTICLE IN NONRELATIVISTIC MECHANICS

Taking as action principle I = L d t = I = L d t = I=int Ldt=I=\int L d t=I=Ldt= extremum, with specified x , t x , t x^('),t^(')x^{\prime}, t^{\prime}x,t and x , t x , t x^(''),t^('')x^{\prime \prime}, t^{\prime \prime}x,t at the two limits, and with L = 1 2 m ( d x / d t ) 2 L = 1 2 m ( d x / d t ) 2 L=(1)/(2)m(dx//dt)^(2)L=\frac{1}{2} m(d x / d t)^{2}L=12m(dx/dt)2, find (1) the extremizing history x = x ( t ) x = x ( t ) x=x(t)x=x(t)x=x(t) and (2) the dynamical path length or action S ( x , t ; x , t ) = I extremum S x , t ; x , t = I extremum  S(x^(''),t^('');x^('),t^('))=I_("extremum ")S\left(x^{\prime \prime}, t^{\prime \prime} ; x^{\prime}, t^{\prime}\right)=I_{\text {extremum }}S(x,t;x,t)=Iextremum  in its dependence on the end points. Also (3) write down the Hamilton-Jacobi equation for this problem, and (4) verify that S ( x , t S ( x , t S(x,tS(x, tS(x,t; x , t ) x , t {:x^('),t^('))\left.x^{\prime}, t^{\prime}\right)x,t) satisfies this equation. Finally, imagining the Hamilton-Jacobi equation not to be known, (5) derive it from the already known properties of the function S S SSS itself.

Exercise 43.2. THE ACTION FOR THE HARMONIC OSCILLATOR

The kinetic energy is 1 2 m ( d x / d t ) 2 1 2 m ( d x / d t ) 2 (1)/(2)m(dx//dt)^(2)\frac{1}{2} m(d x / d t)^{2}12m(dx/dt)2 and the potential energy is 1 2 m ω 2 x 2 1 2 m ω 2 x 2 (1)/(2)momega^(2)x^(2)\frac{1}{2} m \omega^{2} x^{2}12mω2x2. Carry through the analysis of parts (1), (2), (3), (4) of the preceding exercise. Partial answer:
S = m ω 2 ( x 2 + x 2 ) cos ω ( t t ) 2 x x sin ω ( t t ) S = m ω 2 x 2 + x 2 cos ω t t 2 x x sin ω t t S=(m omega)/(2)((x^(2)+x^('2))cos omega(t-t^('))-2xx^('))/(sin omega(t-t^(')))S=\frac{m \omega}{2} \frac{\left(x^{2}+x^{\prime 2}\right) \cos \omega\left(t-t^{\prime}\right)-2 x x^{\prime}}{\sin \omega\left(t-t^{\prime}\right)}S=mω2(x2+x2)cosω(tt)2xxsinω(tt)
Verify that S / x S / x del S//del x\partial S / \partial xS/x gives momentum and S / t S / t -del S//del t-\partial S / \partial tS/t gives energy.

Exercise 43.3. QUANTUM PROPAGATOR FOR HARMONIC OSCILLATOR

Show that the probability amplitude for a simple harmonic oscillator to transit from x , t x , t x^('),t^(')x^{\prime}, t^{\prime}x,t to x , t x , t x^(''),t^('')x^{\prime \prime}, t^{\prime \prime}x,t is
x , t ; x , t = ( m ω 2 π i sin ω ( t t ) ) 1 / 2 × exp i m ω [ ( x 2 + x 2 ) cos ω ( t t ) 2 x x ] 2 sin ω ( t t ) , x , t ; x , t = m ω 2 π i sin ω t t 1 / 2 × exp i m ω x 2 + x 2 cos ω t t 2 x x 2 sin ω t t , {:[(:x^(''),t^('');x^('),t^('):)],[quad=((m omega)/(2pi iℏsin omega(t^('')-t^('))))^(1//2)xx exp((im omega[(x^(''2)+x^('2))cos omega(t^('')-t^('))-2x^('')x^(')])/(2ℏsin omega(t^('')-t^('))))","]:}\begin{aligned} & \left\langle x^{\prime \prime}, t^{\prime \prime} ; x^{\prime}, t^{\prime}\right\rangle \\ & \quad=\left(\frac{m \omega}{2 \pi i \hbar \sin \omega\left(t^{\prime \prime}-t^{\prime}\right)}\right)^{1 / 2} \times \exp \frac{i m \omega\left[\left(x^{\prime \prime 2}+x^{\prime 2}\right) \cos \omega\left(t^{\prime \prime}-t^{\prime}\right)-2 x^{\prime \prime} x^{\prime}\right]}{2 \hbar \sin \omega\left(t^{\prime \prime}-t^{\prime}\right)}, \end{aligned}x,t;x,t=(mω2πisinω(tt))1/2×expimω[(x2+x2)cosω(tt)2xx]2sinω(tt),
and that it reduces for the case of a free particle to
x , t ; x , t = ( m 2 π i ( t t ) ) 1 / 2 exp i m ( x x ) 2 2 ( t t ) x , t ; x , t = m 2 π i t t 1 / 2 exp i m x x 2 2 t t (:x^(''),t^('');x^('),t^('):)=((m)/(2pi iℏ(t^('')-t^('))))^(1//2)exp((im(x^('')-x^('))^(2))/(2ℏ(t^('')-t^('))))\left\langle x^{\prime \prime}, t^{\prime \prime} ; x^{\prime}, t^{\prime}\right\rangle=\left(\frac{m}{2 \pi i \hbar\left(t^{\prime \prime}-t^{\prime}\right)}\right)^{1 / 2} \exp \frac{i m\left(x^{\prime \prime}-x^{\prime}\right)^{2}}{2 \hbar\left(t^{\prime \prime}-t^{\prime}\right)}x,t;x,t=(m2πi(tt))1/2expim(xx)22(tt)
Note that one can derive all the harmonic-oscillator wave functions from the solution by use of the formula
x , t ; x , t = n u n ( x ) u n ( x ) exp i E n ( t t ) / x , t ; x , t = n u n x u n x exp i E n t t / (:x^(''),t^('');x^('),t^('):)=sum_(n)u_(n)(x^(''))u_(n)^(**)(x^('))exp iE_(n)(t^(')-t^(''))//ℏ\left\langle x^{\prime \prime}, t^{\prime \prime} ; x^{\prime}, t^{\prime}\right\rangle=\sum_{n} u_{n}\left(x^{\prime \prime}\right) u_{n}^{*}\left(x^{\prime}\right) \exp i E_{n}\left(t^{\prime}-t^{\prime \prime}\right) / \hbarx,t;x,t=nun(x)un(x)expiEn(tt)/

Exercise 43.4. QUANTUM PROPAGATOR FOR FREE ELECTROMAGNETIC FIELD

In flat spacetime, one is given on the spacelike hypersurface t = t t = t t=t^(')t=t^{\prime}t=t the divergence-free magnetic field B ( x , y , z ) B ( x , y , z ) B^(')(x,y,z)B^{\prime}(x, y, z)B(x,y,z) and on the spacelike hypersurface t = t t = t t=t^('')t=t^{\prime \prime}t=t the divergence-free magnetic field B ( x , y , z ) B ( x , y , z ) B^('')(x,y,z)B^{\prime \prime}(x, y, z)B(x,y,z). By Fourier analysis (reducing this problem to the preceding problem) or otherwise, find the probability amplitude to transit from B B B^(')B^{\prime}B at t t t^(')t^{\prime}t to B B B^('')B^{\prime \prime}B at t t t^('')t^{\prime \prime}t.

Exercise 43.5. HAMILTON-JACOBI FORMULATION OF MAXWELL ELECTRODYNAMICS

Regard the four components A μ A μ A_(mu)A_{\mu}Aμ of the electromagnetic 4-potential as the primary quantities; split them into a space part A i A i A_(i)A_{i}Ai and a scalar potential ϕ ϕ phi\phiϕ. (1) Derive from the action principle (in flat spacetime)
I = ( 1 / 8 π ) ( E 2 B 2 ) d 4 x I = ( 1 / 8 π ) E 2 B 2 d 4 x I=(1//8pi)int(E^(2)-B^(2))d^(4)xI=(1 / 8 \pi) \int\left(E^{2}-B^{2}\right) d^{4} xI=(1/8π)(E2B2)d4x
by splitting off an appropriate divergence, an expression qualitatively similar in character to (43.7). (2) Show that the appropriate quantity to be fixed on the initial and final spacelike hypersurface is not really A i A i A_(i)A_{i}Ai itself, but the magnetic field, defined by B = curl A B = curl A B=curl A\boldsymbol{B}=\operatorname{curl} \boldsymbol{A}B=curlA. (3) Derive the Hamilton-Jacobi equation for the dynamic phase or action S ( B , S ) S ( B , S ) S(B,S)S(\boldsymbol{B}, S)S(B,S) in its dependence on the choice of hypersurface S S SSS, and the choice of magnetic field B B B\boldsymbol{B}B on this hypersurface,
δ S δ Ω = 1 8 π B 2 + ( 4 π ) 2 8 π ( δ S δ A ) 2 δ S δ Ω = 1 8 π B 2 + ( 4 π ) 2 8 π δ S δ A 2 -(delta S)/(delta Omega)=(1)/(8pi)B^(2)+((4pi)^(2))/(8pi)((delta S)/(delta A))^(2)-\frac{\delta S}{\delta \Omega}=\frac{1}{8 \pi} \boldsymbol{B}^{2}+\frac{(4 \pi)^{2}}{8 \pi}\left(\frac{\delta S}{\delta \boldsymbol{A}}\right)^{2}δSδΩ=18πB2+(4π)28π(δSδA)2
The quantity on the left is Tomonaga's "bubble time" derivative [Tomonaga (1946); see also Box 21.1].

44 44 -44-4444

BEYOND THE END OF TIME

"Heaven wheels above youDisplaying to you her eternal glories And still your eyes are on the ground"

DANTE
The world "stands before us as a great, eternal riddle"
EINSTEIN (1949a)
This chapter is entirely Track 2. No previous Track-2 material is needed as preparation for it, but Chapter 43 will be helpful.

§44.1. GRAVITATIONAL COLLAPSE AS THE GREATEST CRISIS IN PHYSICS OF ALL TIME

The universe starts with a big bang, expands to a maximum dimension, then recontracts and collapses: no more awe-inspiring prediction was ever made. It is preposterous. Einstein himself could not believe his own prediction. It took Hubble's observations to force him and the scientific community to abandon the concept of a universe that endures from everlasting to everlasting.
Later work of Tolman (1934a), Avez (1960), Geroch (1967), and Hawking and Penrose (1969) generalizes the conclusion. A model universe that is closed, that obeys Einstein's geometrodynamic law, and that contains a nowhere negative density of mass-energy, inevitably develops a singularity. No one sees any escape from the density of mass-energy rising without limit. A computing machine calculating ahead step by step the dynamical evolution of the geometry comes to the point where it can not go on. Smoke, figuratively speaking, starts to pour out of the computer. Yet physics surely continues to go on if for no other reason than this: Physics is by definition that which does go on its eternal way despite all the shadowy changes in the surface appearance of reality.
Some day a door will surely open and expose the glittering central mechanism of the world in its beauty and simplicity. Toward the arrival of that day, no development holds out more hope than the paradox of gravitational collapse. Why paradox? Because Einstein's equation says "this is the end" and physics says "there is no end." Why hope? Because among all paradigms for probing a puzzle, physics proffers none with more promise than a paradox.
No previous period of physics brought a greater paradox than 1911 (Box 44.1). Rutherford had just been forced to conclude that matter is made up of localized positive and negative charges. Matter as so constituted should undergo electric collapse in 10 17 sec 10 17 sec ∼10^(-17)sec\sim 10^{-17} \mathrm{sec}1017sec, according to theory. Observation equally clearly proclaimed that matter is stable. No one took the paradox more seriously than Bohr. No one worked around and around the central mystery with more energy wherever work was possible. No one brought to bear a more judicious combination of daring and conservativeness, nor a deeper feel for the harmony of physics. The direct opposite
The paradox of collapse: physics stops, but physics must go on
The 1911 crisis of electric collapse
Box 44.1 COLLAPSE OF UNIVERSE PREDICTED BY CLASSICAL THEORY, COMPARED AND CONTRASTED WITH CLASSICALLY PREDICTED COLLAPSE OF ATOM
System Atom (1911) Universe (1970's)
Dynamic entity System of electrons Geometry of space
Nature of classi-
cally predicted
collapse
Nature of classi- cally predicted collapse| Nature of classi- | | :---: | | cally predicted | | collapse |
Electron headed toward point-center of
attraction is driven in a finite time
to infinite energy
Electron headed toward point-center of attraction is driven in a finite time to infinite energy| Electron headed toward point-center of | | :---: | | attraction is driven in a finite time | | to infinite energy |
Not only matter but space itself arrives
in a finite proper time at a condition
of infinite compaction
Not only matter but space itself arrives in a finite proper time at a condition of infinite compaction| Not only matter but space itself arrives | | :---: | | in a finite proper time at a condition | | of infinite compaction |
One rejected "way
out"
One rejected "way out"| One rejected "way | | :--- | | out" |
Give up Coulomb law of force Give up Einstein's field equation
System Atom (1911) Universe (1970's) Dynamic entity System of electrons Geometry of space "Nature of classi- cally predicted collapse" "Electron headed toward point-center of attraction is driven in a finite time to infinite energy" "Not only matter but space itself arrives in a finite proper time at a condition of infinite compaction" "One rejected "way out"" Give up Coulomb law of force Give up Einstein's field equation| System | Atom (1911) | Universe (1970's) | | :---: | :--- | :--- | | Dynamic entity | System of electrons | Geometry of space | | Nature of classi- <br> cally predicted <br> collapse | Electron headed toward point-center of <br> attraction is driven in a finite time <br> to infinite energy | Not only matter but space itself arrives <br> in a finite proper time at a condition <br> of infinite compaction | | One rejected "way <br> out" | Give up Coulomb law of force | Give up Einstein's field equation |
of harmony, cacophony, is the impression that comes from sampling the literature of the 'teens on the structure of the atom. (1) Change the Coulomb law of force between electric charges? (2) Give up the principle that an accelerated charge must radiate? There was little inhibition against this and still wilder abandonings of the well-established. In contrast, Bohr held fast to these two principles. At the same time he insisted on the importance of a third principle, firmly established by Planck in quite another domain of physics, the quantum principle. With that key he opened the door to the world of the atom.
Great as was the crisis of 1911, today gravitational collapse confronts physics with its greatest crisis ever. At issue is the fate, not of matter alone, but of the universe itself. The dynamics of collapse, or rather of its reverse, expansion, is evidenced not by theory alone, but also by observation; and not by one observation, but by observations many in number and carried out by astronomers of unsurpassed ability and integrity. Collapse, moreover, is not unique to the large-scale dynamics of the universe. A white dwarf star or a neutron star of more than critical mass is predicted to undergo gravitational collapse to a black hole (Chapters 32 and 33 ). Sufficiently many stars falling sufficiently close together at the center of the nucleus of a galaxy are expected to collapse to a black hole many powers of ten more massive than the sun. An active search is under way for evidence for a black hole in this Galaxy (Box 33.3). The process that makes a black hole is predicted to provide an experimental model for the gravitational collapse of the universe itself, with one difference. For collapse to a black hole, the observer has his choice whether (1) to observe from a safe distance, in which case his observations will reveal nothing of what goes on inside the horizon; or (2) to follow the falling matter on in, in which case he sees the final stages of the collapse, not only of the matter itself, but of the geometry surrounding the matter, to indefinitely high compaction, but only at the cost of his own early demise. For the gravitational collapse of a closed model universe, no such choice is available to the observer. His fate is sealed. So too is the fate of matter and elementary particles, driven up to indefinitely high densities. The stakes in the crisis of collapse are hard to match: the dynamics of the largest object, space, and the smallest object, an elementary particle-and how both began.

§44.2. ASSESSMENT OF THE THEORY THAT PREDICTS COLLAPSE

No one reflecting on the paradox of collapse ("collapse ends physics"; "collapse cannot end physics") can fail to ask, "What are the limits of validity of Einstein's geometric theory of gravity?" A similar question posed itself in the crisis of 1911. The Coulomb law for the force acting between two charges had been tested at distances of meters and millimeters, but what warrant was there to believe that it holds down to the 10 8 cm 10 8 cm 10^(-8)cm10^{-8} \mathrm{~cm}108 cm of atomic dimensions? Of course, in the end it proves to hold not only at the level of the atom, and at the 10 13 cm 10 13 cm 10^(-13)cm10^{-13} \mathrm{~cm}1013 cm level of the nucleus, but even down to 5 × 10 15 cm 5 × 10 15 cm 5xx10^(-15)cm5 \times 10^{-15} \mathrm{~cm}5×1015 cm [Barber, Gittelman, O'Neill, and Richter, and Bailey et al. (1968), as reviewed by Farley (1969) and Brodsky and Drell (1970)], a striking
example of what Wigner (1960) calls the "unreasonable effectiveness of mathematics in the natural sciences."
No theory more resembles Maxwell's electrodynamics in its simplicity, beauty, and scope than Einstein's geometrodynamics. Few principles in physics are more firmly established than those on which it rests: the local validity of special relativity (Chapters 2-7), the equivalence principle (Chapter 16), the conservation of momentum and energy (Chapters 5, 15 and 16), and the prevalence of second-order field equations throughout physics (Chapter 17). Those principles and the demand for no "extraneous fields" (e.g., Dicke’s scalar field) and "no prior geometry" (§17.6) lead to the conclusion that the geometry of spacetime must be Riemannian and the geometrodynamic law must be Einstein's.
To say that the geometry is Riemannian is to say that the interval between any two nearby events C C CCC and D D DDD, anywhere in spacetime, stated in terms of the interval A B A B ABA BAB between two nearby fiducial events, at quite another point in spacetime, has a value C D / A B C D / A B CD//ABC D / A BCD/AB independent of the route of intercomparison (Chapter 13 and Box 16.4). There are a thousand routes. By this hydraheaded prediction, Einstein's theory thus exposes itself to destruction in a thousand ways (Box 16.4).
Geometrodynamics lends itself to being disproven in other ways as well. The geometry has no option about the control it exerts on the dynamics of particles and fields (Chapter 20). The theory makes predictions about the equilibrium configurations and pulsations of compact stars (Chapters 23-26). It gives formulas (Chapters 27-29) for the deceleration of the expansion of the universe, for the density of mass-energy, and for the magnifying power of the curvature of space, the tests of which are not far off. It predicts gravitational collapse, and the existence of black holes, and a wealth of physics associated with these objects (Chapters 31-34). It predicts gravitational waves (Chapters 35-37). In the appropriate approximation, it encompasses all the well-tested predictions of the Newtonian theory of gravity for the dynamics of the solar system, and predicts testable post-Newtonian corrections besides, including several already verified effects (Chapters 38-40).
No inconsistency of principle has ever been found in Einstein's geometric theory of gravity. No purported observational evidence against the theory has ever stood the test of time. No other acceptable account of physics of comparable simplicity and scope has ever been put forward.
Continue this assessment of general relativity a little further before returning to the central issue, the limits of validity of the theory and their bearing on the issue of gravitational collapse. What has Einstein's geometrodynamics contributed to the understanding of physics?
First, it has dethroned spacetime from a post of preordained perfection high above the battles of matter and energy, and marked it as a new dynamic entity participating actively in this combat.
Second, by tying energy and momentum to the curvature of spacetime, Einstein's theory has recognized the law of conservation of momentum and energy as an automatic consequence of the geometric identity that the boundary of a boundary is zero (Chapters 15 and 17).
Third, it has recognized gravitation as a manifestation of the curvature of the
New view of nature flowing
from Einstein's
geometrodynamics
Electric charge as lines of force trapped in the topology of space
geometry of spacetime rather than as something foreign and "physical" imbedded in spacetime.
Fourth, general relativity has reinforced the view that "physics is local"; that the analysis of physics becomes simple when it connects quantities at a given event with quantities at immediately adjacent events.
Fifth, obedient to the quantum principle, it recognizes that spacetime and time itself are ideas valid only at the classical level of approximation; that the proper arena for the Einstein dynamics of geometry is not spacetime, but superspace; and that this dynamics is described in accordance with the quantum principle by the propagation of a probability amplitude through superspace (Chapter 43). In consequence, the geometry of space is subject to quantum fluctuations in metric coefficients of the order
δ g ( Planck length, L = ( G / c 3 ) 1 / 2 = 1.6 × 10 33 cm ) ( linear extension of region under study ) δ g  Planck length,  L = G / c 3 1 / 2 = 1.6 × 10 33 cm (  linear extension of region under study  ) delta g∼((" Planck length, "L^(**)=(ℏG//c^(3))^(1//2)=1.6 xx10^(-33)(cm)))/((" linear extension of region under study "))\delta g \sim \frac{\left(\text { Planck length, } L^{*}=\left(\hbar G / c^{3}\right)^{1 / 2}=1.6 \times 10^{-33} \mathrm{~cm}\right)}{(\text { linear extension of region under study })}δg( Planck length, L=(G/c3)1/2=1.6×1033 cm)( linear extension of region under study )
Sixth, standard Einstein geometrodynamics is partial as little to Euclidean topology as to Euclidean geometry. A multiply connected topology provides a natural description for electric charge as electric lines of force trapped in the topology of a multiply connected space (Figure 44.1). Any other description of electricity postulates a breakdown in Maxwell's field equations for the vacuum at a site where charge
Figure 44.1.
Electric charge viewed as electric lines of force trapped in the topology of a multiply connected space [for the history of this concept see reference 36 of Wheeler (1968a)]. The wormhole or handle is envisaged as connecting two very different regions in the same space. One of the wormhole mouths, viewed by an observer with poor resolving power, appears to be the seat of an electric charge. Out of this region of 3 -space he finds lines of force emerging over the whole 4 π 4 π 4pi4 \pi4π solid angle. He may construct a boundary around this charge, determine the flux through this boundary, incorrectly apply the theorem of Gauss and "prove" that there is a charge "inside the boundary." It isn't a boundary. Someone caught within it-to speak figuratively-can go into that mouth of the wormhole, through the throat, out the other mouth, and return by way of the surrounding space to look at his "prison" from the outside. Lines of force nowhere end. Maxwell's equations nowhere fail. Nowhere can one place a finger and say, "Here there is some charge." This classical type of electric charge has no direct relation whatsoever to quantized electric charge. There is a freedom of choice about the flux through the wormhole, and a specificity about the connection between one charge and another, which is quite foreign to the charges of elementary particle physics. For ease of visualization the number of space dimensions in the above diagram has been reduced from three to two. The third dimension, measured off the surface, has no physical meaning-it only provides an extra dimension in which to imbed the surface for more convenient diagrammatic representation. [For more detail see Misner and Wheeler (1957), reprinted in Wheeler (1962)].
is located, or postulates the existence of some foreign and "physical" electric jelly imbedded in space, or both. No one has ever found a way to describe electricity free of these unhappy features except to say that the quantum fluctuations in the geometry of space are so great at small distances that even the topology fluctuates, makes "wormholes," and traps lines of force. These fluctuations have to be viewed, not as tied to particles, and endowed with the scale of distances associated with particle physics ( 10 13 cm ) 10 13 cm (∼10^(-13)(cm))\left(\sim 10^{-13} \mathrm{~cm}\right)(1013 cm) but as pervading all space ("foam-like structure of geometry") and characterized by the Planck distance ( 10 33 cm 10 33 cm ∼10^(-33)cm\sim 10^{-33} \mathrm{~cm}1033 cm ). Thus a third type of gravitational collapse forces itself on one's attention, a collapse continually being done and being undone everywhere in space: surely a guide to the outcome of collapse at the level of a star and at the level of the universe (Box 44.2).

Box 44.2 THREE LEVELS OF GRAVITATIONAL COLLAPSE

  1. Universe
  2. Black hole
  3. Fluctuations at the Planck scale of distances
Recontraction and collapse of the universe is a kind of mirror image of the "big bang," on which one already has so much evidence.
Collapse of matter to form a black hole is most natural at two distinct levels: (a) collapse of the dense white-dwarf core of an individual star (when that core exceeds the critical mass, 1 M 1 M ∼1M_(o.)\sim 1 M_{\odot}1M or 2 M 2 M 2M_(o.)2 M_{\odot}2M, at which a neutron star is no longer a possible stable end-point for collapse) and (b) coalescence one by one of the stars in a galactic nucleus to make a black hole of mass up to 10 6 M 10 6 M 10^(6)M_(o.)10^{6} M_{\odot}106M or even 10 9 M 10 9 M 10^(9)M_(o.)10^{9} M_{\odot}109M.
In either case, no feature of principle about matter falling into the black hole is more interesting than the option that the observer has (symbolized by the branching arrow in the inset). He can go along with the infalling matter, in which case he sees the final stages of collapse, but only at the cost of his own demise. Or he can stay safely outside, in which case even after in-
definitely long time he sees only the first part of the collapse, with the infalling matter creeping up more and more slowly to the horizon.
In the final stages of the collapse of a closed model universe, all black holes present are caught up and driven together, amalgamating one by one. No one has any way to look at the event from safely outside; one is inevitably caught up in it oneself.
Collapse at the Planck scale of distances is taking place everywhere and all the time in quantum fluctuations in the geometry and, one believes, the topology of space. In this sense, collapse is continually being done and undone, modeling the undoing of the collapse of the universe itself, summarized in the term, "the reprocessing of the universe" (see text).

§44.3. VACUUM FLUCTUATIONS: THEIR PREVALENCE AND FINAL DOMINANCE

Is matter built out of geometry?
The richness of the physics of the vacuum
If Einstein's theory thus throws light on the rest of physics, the rest of physics also throws light on geometrodynamics. No point is more central than this, that empty space is not empty. It is the seat of the most violent physics. The electromagnetic field fluctuates (Chapter 43). Virtual pairs of positive and negative electrons, in effect, are continually being created and annihilated, and likewise pairs of mu mesons, pairs of baryons, and pairs of other particles. All these fluctuations coexist with the quantum fluctuations in the geometry and topology of space. Are they additional to those geometrodynamic zero-point disturbances, or are they, in some sense not now well-understood, mere manifestations of them?
Put the question in other words. Recall Clifford, inspired by Riemann, speaking to the Cambridge Philosophical Society on February 21, 1870, "On the Space Theory of Matter" [Clifford (1879), pp. 244 and 322; and (1882), p. 21], and saying, "I hold in fact (1) That small portions of space are in fact of a nature analogous to little hills on a surface which is on the average flat; namely, that the ordinary laws of geometry are not valid in them. (2) That this property of being curved or distorted is continually being passed on from one portion of space to another after the manner of a wave. (3) That this variation of the curvature of space is what really happens in that phenomenon which we call the motion of matter, whether ponderable or etherial. (4) That in the physical world nothing else takes place but this variation, subject (possibly) to the law of continuity." Ask if there is a sense in which one can speak of a particle as constructed out of geometry. Or rephrase the question in updated language: "Is a particle a geometrodynamic exciton?" What else is there out of which to build a particle except geometry itself? And what else is there to give discreteness to such an object except the quantum principle?
The Clifford-Einstein space theory of matter has not been forgotten in recent years. "In conclusion," one of the authors wrote a decade ago [Wheeler (1962)], "the vision of Riemann, Clifford, and Einstein, of a purely geometric basis for physics, today has come to a higher state of development, and offers richer prospects-and presents deeper problems-than ever before. The quantum of action adds to this geometrodynamics new features, of which the most striking is the presence of fluctuations of the wormhole type throughout all space. If there is any correspondence at all between this virtual foam-like structure and the physical vacuum as it has come to be known through quantum electrodynamics, then there seems to be no escape from identifying these wormholes with 'undressed electrons.' Completely different from these 'undressed electrons,' according to all available evidence, are the electrons and other particles of experimental physics. For these particles the geometrodynamic picture suggests the model of collective disturbances in a virtual foam-like vacuum, analogous to different kinds of phonons or excitons in a solid.
"The enormous factor from nuclear densities 10 14 g / cm 3 10 14 g / cm 3 ∼10^(14)g//cm^(3)\sim 10^{14} \mathrm{~g} / \mathrm{cm}^{3}1014 g/cm3 to the density of field fluctuation energy in the vacuum, 10 94 g / cm 3 10 94 g / cm 3 ∼10^(94)g//cm^(3)\sim 10^{94} \mathrm{~g} / \mathrm{cm}^{3}1094 g/cm3, argues that elementary particles represent a percentage-wise almost completely negligible change in the locally violent conditions that characterize the vacuum. ['A particle ( 10 14 g / cm 3 ) 10 14 g / cm 3 (10^(14)(g)//cm^(3))\left(10^{14} \mathrm{~g} / \mathrm{cm}^{3}\right)(1014 g/cm3) means as little
to the physics of the vacuum ( 10 94 g / cm 3 ) 10 94 g / cm 3 (10^(94)(g)//cm^(3))\left(10^{94} \mathrm{~g} / \mathrm{cm}^{3}\right)(1094 g/cm3) as a cloud ( 10 6 g / cm 3 ) 10 6 g / cm 3 (10^(-6)(g)//cm^(3))\left(10^{-6} \mathrm{~g} / \mathrm{cm}^{3}\right)(106 g/cm3) means to the physics of the sky ( 10 3 g / cm 3 ) . ] 10 3 g / cm 3 . {:(10^(-3)(g)//cm^(3)).^(')]\left.\left(10^{-3} \mathrm{~g} / \mathrm{cm}^{3}\right) .{ }^{\prime}\right](103 g/cm3).] In other words, elementary particles do not form a really basic starting point for the description of nature. Instead, they represent a first-order correction to vacuum physics. That vacuum, that zero-order state of affairs, with its enormous densities of virtual photons and virtual positive-negative pairs and virtual wormholes, has to be described properly before one has a fundamental starting point for a proper perturbation-theoretic analysis."
"These conclusions about the energy density of the vacuum, its complicated topological character, and the richness of the physics which goes on in the vacuum, stand in no evident contradiction with what quantum electrodynamics has to say about the vacuum. Instead the conclusions from the 'small distance' analysis ( 10 33 cm ) 10 33 cm (10^(-33)(cm))\left(10^{-33} \mathrm{~cm}\right)(1033 cm)-sketchy as it is-and from 'larger distance' analysis ( 10 11 cm ) 10 11 cm (10^(-11)(cm))\left(10^{-11} \mathrm{~cm}\right)(1011 cm) would seem to [be able] to reinforce each other in a most natural way.
"The most evident shortcoming of the geometrodynamic model as it stands is this, that it fails to supply any completely natural place for spin 1 2 1 2 (1)/(2)\frac{1}{2}12 in general and for the neutrino in particular."
Attempts to find a natural place for spin 1 2 1 2 (1)/(2)\frac{1}{2}12 in Einstein's standard geometrodynamics (Box 44.3) founder because there is no natural way for a change in connectivity to take place within the context of classical differential geometry.
A uranium nucleus undergoing fission starts with one topology and nevertheless ends up with another topology. It makes this transition in a perfectly continuous way, classical differential geometry notwithstanding.
There are reputed to be two kinds of lawyers. One tells the client what not to do. The other listens to what the client has to do and tells him how to do it. From the first lawyer, classical differential geometry, the client goes away disappointed, still searching for a natural way to describe quantum fluctuations in the connectivity of space. Only in this way can he hope to describe electric charge as lines of electric force trapped in the topology of space. Only in this way does he expect to be able to understand and analyze the final stages of gravitational collapse. Pondering his problems, he comes to the office of a second lawyer, with the name "Pregeometry" on the door. Full of hope, he knocks and enters. What is pregeometry to be and say? Born of a combination of hope and need, of philosophy and physics and mathematics and logic, pregeometry will tell a story unfinished at this writing, but full of incidents of evolution so far as it goes.

§44.4. NOT GEOMETRY, BUT PREGEOMETRY, AS THE MAGIC BUILDING MATERIAL

An early survey (Box 44.4) asked whether geometry can be constructed with the help of the quantum principle out of more basic elements, that do not themselves have any specific dimensionality.
The focus of attention in this 1964 discussion was "dimensionality without dimensionality." However, the prime pressures to ponder pregeometry were and remain
No place in geometrodynamics for change of topology; therefore turn to "pregeometry"

Box 44.3 THE DIFFICULTIES WITH ATTEMPTS TO FIND A NATURAL PLACE FOR SPIN 1 2 1 2 (1)/(2)\frac{1}{2}12 IN EINSTEIN'S STANDARD GEOMETRODYNAMICS

"It is impossible" [Wheeler (1962)] "to accept any description of elementary particles that does not have a place for spin 1 2 spin 1 2 spin(1)/(2)\operatorname{spin} \frac{1}{2}spin12. What, then, has any purely geometric description to offer in explanation of spin 1 2 spin 1 2 spin(1)/(2)\operatorname{spin} \frac{1}{2}spin12 in general? More particularly and more importantly, what place is there in quantum geometrodynamics for the neutrino-the only entity of half-integral spin that is a pure field in its own right, in the sense that it has zero rest mass and moves with the speed of light? No clear or satisfactory answer is known to this question today. Unless and until an answer is forthcoming, pure geometrodynamics must be judged deficient as a basis for elementary particle physics."
A later publication [Wheeler (1968a)] takes up this issue again, noting that, "A new world opens out for analysis in quantum geometrodynamics. The central new concept is space resonating between one foamlike structure and another. For this multiple-connectedness of space at submicroscopic distances no single feature of nature speaks more powerfully than electric charge. Yet at least as impressive as charge is the prevalence of spin 1 2 1 2 (1)/(2)\frac{1}{2}12 throughout the world of elementary particles."
Repeating the statement that "It is impossible to accept any description of elementary particles that does not have a place for spin 1 2 1 2 (1)/(2)\frac{1}{2}12," the article adds to the discussion a new note: "Happily, the concept of spin manifold has come to light, not least through the work of John Milnor [see Lichnerowicz (1961a,b,c) and (1964); Milnor (1962), (1963), and (1965a,b); Hsiang and Anderson (1965); Anderson, Brown, and Peterson (1966a,b); and Penrose (1968a)]. This concept suggests a new and interesting interpretation of a spinor field within the context of the resonating microtopology of quantum geometrodynamics, as the nonclassical two-valuedness [Pauli's standard term for spin; see, for example, Pauli (1947)] that attaches to the probability amplitude for otherwise identical 3-geometries endowed with alternative 'spin structures.'" More specifically: "One does not classify the closed orientable 3-manifold of physics completely
when one gives its topology, its differential structures, and its metric. One must tell which spin structure it has." [On a 3-geometry with the topology of a 3 -sphere, one can lay down a continuous field of triads (a triad consisting of three orthonormal vectors). Any other continuous field of triads can be deformed into the first field by a continuous sequence of small readjustments. One says that the 3 -sphere admits only one "spin structure," a potentially misleading standard word for what could just as well have been called a "triad structure." In contrast, a 3 -sphere with n n nnn handles or wormholes admits 2 n 2 n 2^(n)2^{n}2n "spin structures" (continuous fields of triads) inequivalent to one another under any continuous sequence of small readjustments whatsoever, and distinguished from one another in any convenient way by n n nnn "descriptors" w 1 , w 2 , w 1 , w 2 , w_(1),w_(2),dotsw_{1}, w_{2}, \ldotsw1,w2,, w k , w n w k , w n w_(k),dotsw_(n)w_{k}, \ldots w_{n}wk,wn. I It is natural in quantum geometrodynamics to expect "separate probability amplitudes for a 3-geometry with descriptor w k = + 1 w k = + 1 w_(k)=+1w_{k}=+1wk=+1 and for an otherwise identical 3 -geometry with descriptor w k = 1 w k = 1 w_(k)=-1w_{k}=-1wk=1. Does this circumstance imply that quantum geometrodynamics supplies all the machinery one needs to describe fields of spin 1 2 1 2 (1)/(2)\frac{1}{2}12 in general and the neutrino field in particular? . . . That is the only way that has ever turned up within the framework of Einstein's general relativity and Planck's quantum principle. Is this the right path? It is difficult to name any question more decisive than this in one's assessment of 'everything as geometry.'"
Why not spell out these concepts, reduce them to practice, and compare them with what one knows about the behavior of fields of spin 1 2 1 2 (1)/(2)\frac{1}{2}12 ? There is a central difficulty in this enterprise. It assumes and demands on physical grounds that the topology of the 3-geometry shall be free to change from one connectivity to another. In contrast, classical differential geometry says, in effect, "Once one topology, always that topology." Try a question like this, "When a new handle develops and the number of descriptors rises by one, what boundary condition in superspace connects the probability
amplitude ψ ψ psi\psiψ for 3-geometries of the original topology with the probability amplitudes ψ + ψ + psi_(+)\psi_{+}ψ+and ψ ψ psi_(-)\psi_{-}ψfor the two spin structures of the new topology?" Classical differential geometry not only gives one no help in answering this question; it even forbids one to ask it. In other words, one cannot even get the enterprise "on the road" for want of a natural
mathematical way to describe the required change in topology. The idea is therefore abandoned here and now that 3 -geometry is "the magic building material of the universe." In contrast, pregeometry (see text), far from being endowed with any frozen topology, is to be viewed as not even possessing any dimensionality.

Box 44.4 "A BUCKET OF DUST"-AN EARLY ATTEMPT TO FORMULATE THE CONCEPT OF PREGEOMETRY [Wheeler (1964a)]

"What line of thought could ever be imagined as leading to four dimensions-or any dimensionality at all-out of more primitive considerations? In the case of atoms one derives the yellow color of the sodium D-lines by analyzing the quantum dynamics of a system, no part of which is ever endowed with anything remotely resembling the attribute of color. Likewise any derivation of the four-dimensionality of spacetime can hardly start with the idea of dimensionality."
"Recall the notion of a Borel set. Loosely speaking, a Borel set is a collection of points ("bucket of dust") which have not yet been assembled into a manifold of any particular dimensionality. . . . Recalling the universal sway of the quantum principle, one can imagine probability amplitudes for the points in a Borel set to be assembled into points with this, that, and the other dimensionality. . . . More conditions have to be imposed on a given number of points-as to which has which for a nearest neighbor-when the points are put together in a five-dimensional array than when these same points are arranged in a two-dimensional pattern. Thus one can think of each dimensionality as having a much higher statistical weight than the next higher dimensionality. On the other hand, for manifolds with one, two, and three dimensions, the geometry is too rudimentary-one can suppose-to give anything interesting. Thus Einstein's field equations, applied to a manifold of dimensionality so low, demand flat space; only when the dimensionality is as high as four do really interesting possibilities arise. Can four,
therefore, be considered to be the unique dimensionality which is at the same time high enough to give any real physics and yet low enough to have great statistical weight?
"It is too much to imagine that one has yet made enough mistakes in this domain of thought to explore such ideas with any degree of good judgment."
Consider a handle on the geometry. Let it thin halfway along its length to a point. In other words, let the handle dissolve into two bent prongs that touch at a point. Let these prongs separate and shorten. In this process two points part company that were once immediate neighbors. "However sudden the change is in classical theory, in quantum theory there is a probability amplitude function which falls off in the classically forbidden domain. In other words, there is some residual connection between points which are ostensibly very far away (travel from one 'tip' down one prong, then through the larger space to which these prongs are attached, and then up the other prong to the other tip). But there is nothing distinctive in principle about the two points that have happened to come into discussion. Thus it would seem that there must be a connection . . . between every point and every other point. Under these conditions the concept of nearest neighbor would appear no longer to make sense. Thus the tool disappears with the help of which one might otherwise try to speak [un]ambiguously about dimensionality."
Sakharov: gravitation is the "metric elasticity of space"
The stratification of space
Comparison with everyday elasticity
two features of nature, spin 1 2 1 2 (1)/(2)\frac{1}{2}12 and charge, that speak out powerfully from every part of elementary particle physics.
A fresh perspective on pregeometry comes from a fresh assessment of general relativity. "Geometrodynamics is neither as important or as simple as it looks. Do not make it the point of departure in searching for underlying simplicity. Look deeper, at elementary particle physics." This is the tenor of interesting new considerations put forward by Sakharov [the Sakharov] (1967) and summarized under the heading, "Gravitation as the metric elasticity of space," in Box 17.2. In brief, as elasticity is to atomic physics, so-in Sakharov's view-gravitation is to elementary particle physics. The energy of an elastic deformation is nothing but energy put into the bonds between atom and atom by the deformation. The energy that it takes to curve space is nothing but perturbation in the vacuum energy of fields plus particles brought about by that curvature, according to Sakharov. The energy required for the deformation is governed in the one case by two elastic constants and in the other case by one elastic constant (the Newtonian constant of gravity) but in both cases, he reasons, the constants arise by combination of a multitude of complicated individual effects, not by a brave clean stroke on an empty slate.
One gives all the more favorable reception to Sakhorov's view of gravity because one knows today, as one did not in 1915, how opulent in physics the vacuum is. In Einstein's day one had come in a single decade from the ideal God-given Lorentz perfection of flat spacetime to curved spacetime. It took courage to assign even one physical constant to that world of geometry that had always stood so far above physics. The vacuum looked for long as innocent of structure as a sheet of glass emerging from a rolling mill. With the discovery of the positive electron [Anderson (1933)], one came to recognize a little of the life that heat can unfreeze in "empty" space. Each new particle and radiation that was discovered brought a new accretion to the recognized richness of the vacuum. Macadam looks smooth, but a bulldozer has only to cut a single furrow through the roadway to disclose all the complications beneath the surface.
Think of a particle as built out of the geometry of space; think of a particle as a "geometrodynamic exciton"? No model-it would seem to follow from Sakharov's assessment-could be less in harmony with nature, except to think of an atom as built out of elasticity! Elasticity did not explain atoms. Atoms explained elasticity. If, likewise, particles fix the constant in Einstein's geometrodynamic law (Sakharov), must it not be unreasonable to think of the geometrodynamic law as explaining particles?
Carry the comparison between geometry and elasticity one stage deeper (Fig. 44.2). In a mixed solid there are hundreds of distinct bonds, all of which contribute to the elastic constants; some of them arise from Van der Waal's forces, some from ionic coupling, some from homopolar linkage; they have the greatest variety of strengths; but all have their origin in something so fantastically simple as a system of positively and negatively charged masses moving in accordance with the laws of quantum mechanics. In no way was it required or right to meet each complication of the chemistry and physics of a myriad of bonds with a corresponding complication of principle. By going to a level of analysis deeper than bond strengths, one had
Figure 44.2.
Elasticity and geometrodynamics, as viewed at three levels of analysis. A hundred years of the study of elasticity did not reveal the existence of molecules, and a hundred years of the study of molecular chemistry did not reveal Schrödinger's equation. Revelation moved upward in the diagram, not downward.
emerged into a world of light, where nothing but simplicity and unity was to be seen.
Compare with geometry. The vacuum is animated with the zero-point activity of distinct fields and scores of distinct particles, all of which, according to Sakharov, contribute to the Newtonian G G GGG, the "elastic constant of the metric." Some interact via weak forces, some by way of electromagnetic forces, and some through strong forces. These interactions have the greatest variety of strengths. But must not all these particles and interactions have their origin in something fantastically simple? And must not this something, this "pregeometry," be as far removed from geometry as the quantum mechanics of electrons is far removed from elasticity?
If one once thought of general relativity as a guide to the discovery of pregeometry, nothing might seem more dismaying than this comparison with an older realm of physics. No one would dream of studying the laws of elasticity to uncover the principles of quantum mechanics. Neither would anyone investigate the work-hardening of a metal to learn about atomic physics. The order of understanding ran not
Work-hardening ( 1 cm ) dislocations ( 10 4 cm ) atoms ( 10 8 cm ) ,  Work-hardening  ( 1 cm )  dislocations  10 4 cm  atoms  10 8 cm " Work-hardening "(1cm)longrightarrow" dislocations "(10^(-4)(cm))longrightarrow" atoms "(10^(-8)(cm))", "\text { Work-hardening }(1 \mathrm{~cm}) \longrightarrow \text { dislocations }\left(10^{-4} \mathrm{~cm}\right) \longrightarrow \text { atoms }\left(10^{-8} \mathrm{~cm}\right) \text {, } Work-hardening (1 cm) dislocations (104 cm) atoms (108 cm)
but the direct opposite,
Atoms ( 10 8 cm ) dislocations ( 10 4 cm ) work-hardening ( 1 cm )  Atoms  10 8 cm  dislocations  10 4 cm  work-hardening  ( 1 cm ) " Atoms "(10^(-8)(cm))longrightarrow" dislocations "(10^(-4)(cm))longrightarrow" work-hardening "(1cm)\text { Atoms }\left(10^{-8} \mathrm{~cm}\right) \longrightarrow \text { dislocations }\left(10^{-4} \mathrm{~cm}\right) \longrightarrow \text { work-hardening }(1 \mathrm{~cm}) Atoms (108 cm) dislocations (104 cm) work-hardening (1 cm)
One had to know about atoms to conceive of dislocations, and had to know about dislocations to understand work-hardening. Is it not likewise hopeless to go from the "elasticity of geometry" to an understanding of particle physics, and from particle physics to the uncovering of pregeometry? Must not the order of progress again be the direct opposite? And is not the source of any dismay the apparent loss of guidance that one experiences in giving up geometrodynamics-and not only geometrodynamics, but geometry itself-as a crutch to lean on as one hobbles forward? Yet there is so much chance that this view of nature is right that one must take it seriously and explore its consequences. Never more than today does one have the incentive to explore pregeometry.

§44.5. PREGEOMETRY AS THE CALCULUS OF PROPOSITIONS

Paper in white the floor of the room, and rule it off in one-foot squares. Down on one's hands and knees, write in the first square a set of equations conceived as able to govern the physics of the universe. Think more overnight. Next day put a better set of equations into square two. Invite one's most respected colleagues to contribute to other squares. At the end of these labors, one has worked oneself out into the door way. Stand up, look back on all those equations, some perhaps more hopeful than others, raise one's finger commandingly, and give the order "Fly!" Not one of those equations will put on wings, take off, or fly. Yet the universe "flies."
Some principle uniquely right and uniquely simple must, when one knows it, be also so compelling that it is clear the universe is built, and must be built, in such and such a way, and that it could not possibly be otherwise. But how can one discover that principle? If it was hopeless to learn atomic physics by studying work-hardening and dislocations, it may be equally hopeless to learn the basic operating principle of the universe, call it pregeometry or call it what one will, by any amount of work in general relativity and particle physics.
Thomas Mann (1937), in his essay on Freud, utters what Niels Bohr would surely have called a great truth ("A great truth is a truth whose opposite is also a great truth") when he says, "Science never makes an advance until philosophy authorizes and encourages it to do so." If the equivalence principle (Chapter 16) and Mach's principle ( $ 21.9 $ 21.9 $21.9\$ 21.9$21.9 ) were the philosophical godfathers of general relativity, it is also true that what those principles do mean, and ought to mean, only becomes clear by study and restudy of Einstein's theory itself. Therefore it would seem reasonable to expect the primary guidance in the search for pregeometry to come from a principle both philosophical and powerful, but one also perhaps not destined to be wholly clear in its contents or its implications until some later day.
Among all the principles that one can name out of the world of science, it is difficult to think of one more compelling than simplicity; and among all the simplicities of dynamics and life and movement, none is starker [Werner (1969)] than the binary choice yes-no or true-false. It in no way proves that this choice for a starting principle is correct, but it at least gives one some comfort in the choice, that Pauli's "nonclassical two-valuedness" or "spin" so dominates the world of particle physics.
It is one thing to have a start, a tentative construction of pregeometry; but how does one go on? How not to go on is illustrated by Figure 44.3. The "sewing machine" builds objects of one or another definite dimensionality, or of mixed dimensionalities, according to the instructions that it receives on the input tape in yes-no binary code. Some of the difficulties of building up structure on the binary element according to this model, or any one of a dozen other models, stand out at once. (1) Why N = 10 , 000 N = 10 , 000 N=10,000N=10,000N=10,000 building units? Why not a different N N NNN ? And if one feeds in one such arbitrary number at the start, why not fix more features "by hand?" No natural stopping point is evident, nor any principle that would fix such a stopping point. Such arbitrariness contradicts the principle of simplicity and rules out the model. (2) Quantum mechanics is added from outside, not generated from inside (from the model itself). On this point too the principle of simplicity speaks against the model. (3) The passage from pregeometry to geometry is made in a too-literal-minded way, with no appreciation of the need for particles and fields to appear along the way. The model, in the words used by Bohr on another occasion, is "crazy, but not crazy enough to be right."
Noting these difficulties, and fruitlessly trying model after model of pregeometry to see if it might be free of them, one suddenly realizes that a machinery for the combination of yes-no or true-false elements does not have to be invented. It already exists. What else can pregeometry be, one asks oneself, than the calculus of propositions? (Box 44.5.)

§44.6. THE BLACK BOX: THE REPROCESSING OF THE UNIVERSE

No amount of searching has ever disclosed a "cheap way" out of gravitational collapse, any more than earlier it revealed a cheap way out of the collapse of the atom. Physicists in that earlier crisis found themselves in the end confronted with a revolutionary pistol, "Understand nothing-or accept the quantum principle." Today's crisis can hardly force a lesser revolution. One sees no alternative except to say that geometry fails and pregeometry has to take its place to ferry physics through the final stages of gravitational collapse and on into what happens next. No guide is evident on this uncharted way except the principle of simplicity, applied to drastic lengths.
Whether the whole universe is squeezed down to the Planck dimension, or more or less, before reexpansion can begin and dynamics can return to normal, may be irrelevant for some of the questions one wants to consider. Physics has long used the "black box" to symbolize situations where one wishes to concentrate on what goes in and what goes out, disregarding what takes place in between.
At the beginning of the crisis of electric collapse one conceived of the electron as headed on a deterministic path toward a point-center of attraction, and unhappily destined to arrive at a condition of infinite kinetic energy in a finite time. After the advent of quantum mechanics, one learned to summarize the interaction between
A first try at a pregeometry built on the principle of binary choice
A more reasonable picture: pregeometry is the calculus of propositions
The role of the black box in physics
Figure 44.3.
"Ten thousand rings"; or an example of a way to think of the connection between pregeometry and geometry, wrong because it is too literal-minded, and for other reasons spelled out in the text. The vizier [story by Wheeler, as alluded to by Kilmister (1971)*] speaks: "Take N = 10 , 000 N = 10 , 000 N=10,000N=10,000N=10,000 brass rings. Take an automatic fastening device that will cut open a ring, loop it through another ring, and resolder the joint. Pour the brass rings into the hopper that feeds this machine. Take a strip of instruction paper that is long enough to contain N ( N 1 ) / 2 N ( N 1 ) / 2 N(N-1)//2N(N-1) / 2N(N1)/2 binary digits. Look at the instruction in the ( j k ) ( j k ) (jk)(j k)(jk)-th location on this instruction tape ( j , k = 1 , 2 , , N ; j < k ) ( j , k = 1 , 2 , , N ; j < k ) (j,k=1,2,dots,N;j < k)(j, k=1,2, \ldots, N ; j<k)(j,k=1,2,,N;j<k). When the binary digit at that location is 0 , it is a signal to leave the j j jjj-th ring disconnected from the k k kkk-th ring. When it is 1 , it is an instruction to connect that particular pair of rings. Thread the tape into the machine and press the start button. The clatter begins. Out comes a chain of rings 10,000 links long. It falls on the table and the machine stops. Pour in another 10,000 rings, feed in a new instruction tape, and push the button again. This time it is not a one-dimensional structure that emerges, but a two-dimensional one: a Crusader's coat of mail, complete with neck opening and sleeves. Take still another tape from the library of tapes and repeat. Onto the table thuds a smaller version of the suit of mail, this time filled out internally with a solid network of rings, a three-dimensional structure. Now forego the library and make one's own instruction tape, a random string of 0's and l's. Guided by it, the fastener produces a "Christmas tree ornament," a collection of segments of one-dimensional chain, two-dimensional surfaces, and three-, four-, five-, and higher-dimensional entities, some joined together, some free-floating. Now turn from a structure deterministically fixed by a tape to a probability amplitude, a complex number,
(1) ψ ( tape ) = ψ ( n 12 , n 13 , n 14 , , n N 1 , N ) ( n i j = 0 , 1 ) (1) ψ (  tape  ) = ψ n 12 , n 13 , n 14 , , n N 1 , N n i j = 0 , 1 {:(1)psi(" tape ")=psi(n_(12),n_(13),n_(14),dots dots,n_(N-1,N))quad(n_(ij)=0,1):}\begin{equation*} \psi(\text { tape })=\psi\left(n_{12}, n_{13}, n_{14}, \ldots \ldots, n_{N-1, N}\right) \quad\left(n_{i j}=0,1\right) \tag{1} \end{equation*}(1)ψ( tape )=ψ(n12,n13,n14,,nN1,N)(nij=0,1)
defined over the entire range of possibilities for structures built of 10,000 rings. Let these probability amplitudes not be assigned randomly. Instead, couple together amplitudes, for structures that differ from each other by the breaking of a single ring, by linear formulas that treat all rings on the same footing. The separate ψ ψ psi\psiψ 's, no longer entirely independent, will still give non-zero probability amplitudes for "Christmas tree ornaments." Of greater immediate interest than these "unruly" parts of the structures are the following questions about the smoother parts: (1) In what kinds of structures is the bulk of the probability concentrated? (2) What is the dominant dimensionality of these structures in an appropriate correspondence principle limit? (3) In this semiclassical limit, what is the form taken by the dynamic law of evolution of the geometry?" No principle more clearly rules out this model for pregeometry than the principle of simplicity (see text).

Box 44.5 "PREGEOMETRY AS THE CALCULUS OF PROPOSITIONS"

A sample proposition taken out of a standard text on logic selected almost at random reads [Kneebone (1963), p. 40]
[ X ( ( X X ) Y ) ] & ( X ¯ Z ) eq ( X ¯ Y Z ) & ( X ¯ Y Z ¯ ) & ( X Y Z ) & ( X Y ¯ Z ) . [ X ( ( X X ) Y ) ] & ( X ¯ Z )  eq  ( X ¯ Y Z ) & ( X ¯ Y Z ¯ ) & ( X Y Z ) & ( X Y ¯ Z ) . {:[[X longrightarrow((X longrightarrow X)longrightarrow Y)]&( bar(X)longrightarrow Z)" eq "( bar(X)vv Y vv Z)&],[( bar(X)vv Y vv bar(Z))&(X vv Y vv Z)&(X vv bar(Y)vv Z).]:}\begin{aligned} {[X \longrightarrow} & ((X \longrightarrow X) \longrightarrow Y)] \&(\bar{X} \longrightarrow Z) \text { eq }(\bar{X} \vee Y \vee Z) \& \\ & (\bar{X} \vee Y \vee \bar{Z}) \&(X \vee Y \vee Z) \&(X \vee \bar{Y} \vee Z) . \end{aligned}[X((XX)Y)]&(X¯Z) eq (X¯YZ)&(X¯YZ¯)&(XYZ)&(XY¯Z).
The symbols have the following meaning:
A ¯ A ¯ bar(A)\bar{A}A¯, Not A ; A ; A;A ;A;
A B A B A vv BA \vee BAB, A A AAA or B B BBB or both (" A A AAA vel B B BBB ");
A & B A & B A&BA \& BA&B, A A AAA and B ; B ; B;B ;B;
A B A B A longrightarrow BA \longrightarrow BAB, A A AAA implies B B BBB ("if A A AAA, then B B BBB ");
A B A B A longleftrightarrow BA \longleftrightarrow BAB, B B BBB is equivalent to A A AAA (" B B BBB if and only if A A AAA ").
bar(A), Not A; A vv B, A or B or both (" A vel B "); A&B, A and B; A longrightarrow B, A implies B ("if A, then B "); A longleftrightarrow B, B is equivalent to A (" B if and only if A ").| $\bar{A}$, | Not $A ;$ | | :--- | :--- | | $A \vee B$, | $A$ or $B$ or both (" $A$ vel $B$ "); | | $A \& B$, | $A$ and $B ;$ | | $A \longrightarrow B$, | $A$ implies $B$ ("if $A$, then $B$ "); | | $A \longleftrightarrow B$, | $B$ is equivalent to $A$ (" $B$ if and only if $A$ "). |
Propositional formula © is said to be equivalent ("eq") to propositional formula Bif if and only if A A Alongleftrightarrow\mathfrak{A} \longleftrightarrowA is a tautology. The letters A , B A , B A,BA, BA,B, etc., serve as connectors to "wire together" one proposition with another. Proceeding in this way, one can construct propositions of indefinitely great length.
A switching circuit [see, for example, Shannon (1938) or Hohn (1966)] is isomorphic to a proposition.
Compare a short proposition or an elementary switching circuit to a molecular collision. No idea seemed more preposterous than that of Daniel Bernoulli (1733), that heat is a manifestation of molecular collisions. Moreover, a three-body encounter is difficult to treat, a four-body collision is more difficult, and a five- or more molecule system is essentially intractable. Nevertheless, mechanics acquires new elements of simplicity in the limit in which the number of molecules is very great and in which one can use the concept of density in phase space. Out of statistical mechanics in this limit come such concepts as temperature and entropy. When the temperature is well-defined, the energy of the system is not a well-defined idea; and when the energy is well-defined, the temperature is not. This complementarity is built inescapably into the principles of the subject. Thrust the finger into the flame of a match and experience a sensation like nothing else on heaven or earth; yet what happens is all a consequence of molecular collisions, early critics notwithstanding.
Any individual proposition is difficult for the mind to apprehend when it is long; and still more difficult to grasp is the content of a cluster of propositions. Nevertheless, make a statistical analysis of the calculus of propositions in the limit where the number of propositions is great and most of them are long. Ask if parameters force themselves on one's attention in this analysis (1) analogous in some small measure to the temperature and entropy of statistical mechanics but (2) so much

Box 44.5 (continued)

more numerous, and everyday dynamic in character, that they reproduce the continuum of everyday physics.
Nothing could seem so preposterous at first glance as the thought that nature is built on a foundation as ethereal as the calculus of propositions. Yet, beyond the push to look in this direction provided by the principle of simplicity, there are two pulls. First, bare-bones quantum mechanics lends itself in a marvelously natural way to formulation in the language of the calculus of propositions, as witnesses not least the book of Jauch (1968). If the quantum principle were not in this way already automatically contained in one's proposed model for pregeometry, and if in contrast it had to be introduced from outside, by that very token one would conclude that the model violated the principle of simplicity, and would have to reject it. Second, the pursuit of reality seems always to take one away from reality. Who would have imagined describing something so much a part of the here and now as gravitation in terms of curvature of the geometry of spacetime? And when later this geometry came to be recognized as dynamic, who would have dreamed that geometrodynamics unfolds in an arena so ethereal as superspace? Little astonishment there should be, therefore, if the description of nature carries one in the end to logic, the ethereal eyrie at the center of mathematics. If, as one believes, all mathematics reduces to the mathematics of logic, and all physics reduces to mathematics, what alternative is there but for all physics to reduce to the mathematics of logic? Logic is the only branch of mathematics that can "think about itself."
"An issue of logic having nothing to do with physics" was the assessment by many of a controversy of old about the axiom, "parallel lines never meet." Does it follow from the other axioms of Euclidean geometry or is it independent? "Independent," Bolyai and Lobachevsky proved. With this and the work of Gauss as a start, Riemann went on to create Riemannian geometry. Study nature, not Euclid, to find out about geometry, he advised; and Einstein went on to take that advice and to make geometry a part of physics.
"An issue of logic having nothing to do with physics" is one's natural first assessment of the startling limitations on logic discovered by Gödel (1931), Cohen (1966), and others [for a review, see, for example, Kac and Ulam (1968)]. The exact opposite must be one's assessment if the real pregeometry of the real physical world indeed turns out to be identical with the calculus of propositions.
"Physics as manifestation of logic" or "pregeometry as the calculus of propositions" is as yet [Wheeler (1971a)] not an idea, but an idea for an idea. It is put forward here only to make it a little clearer what it means to suggest that the order of progress may not be
physics pregeometry  physics   pregeometry  " physics "longrightarrow" pregeometry "\text { physics } \longrightarrow \text { pregeometry } physics  pregeometry 
but
pregeometry physics.  pregeometry   physics.  " pregeometry "longrightarrow" physics. "\text { pregeometry } \longrightarrow \text { physics. } pregeometry  physics. 
Figure 44.4.
The "black-box model" applied (1) to the scattering of an electron by a center of attraction and (2) to the collapse of the universe itself. The deterministic electron world line of classical theory is replaced in quantum theory by a probability amplitude, the wave crests of which are illustrated schematically in the diagram. The catastrophe of classical theory is replaced in quantum theory by a probability distribution of outputs. The same diagram illustrates the "black-box account" of gravitational collapse mentioned in the text. The arena of the diagram is no longer spacetime, but superspace. The incident arrow marks no longer a classical world line of an electron through spacetime, but a classical "leaf of history of geometry" slicing through superspace (Chapter 43). The wave crests symbolize no longer the electron wave function propagating through spacetime, but the geometrodynamic wave function propagating through superspace. The cross-hatched region is no longer the region where the one-body potential goes to infinity, but the region of gravitational collapse where the curvature of space goes to infinity. The outgoing waves describe no longer alternative directions for the new course of the scattered electron, but the beginnings of alternative new histories for the universe itself after collapse and "reprocessing" end the present cycle.
center of attraction and electron in a "black box:" fire in a wave-train of electrons traveling in one direction, and get electrons coming out in this, that, and the other direction with this, that, and the other well-determined probability amplitude (Figure 44.4). Moreover, to predict these probability amplitudes quantitatively and correctly, it was enough to translate the Hamiltonian of classical theory into the language of wave mechanics and solve the resulting wave equation, the key to the "black box."
A similar "black box" view of gravitational collapse leads one to expect a "probability distribution of outcomes." Here, however, one outcome is distinguished from another, one must anticipate, not by a single parameter, such as the angle of scattering of the electron, but by many. They govern, one foresees, such quantities as the size of the system at its maximum of expansion, the time from the start of this new cycle to the moment it ends in collapse, the number of particles present, and a thousand other features. The "probabilities" of these outcomes will be governed by a dynamic law, analogous to (1) the Schrödinger wave equation for the electron, or, to cite another black box problem, (2) the Maxwell equations that couple together, at a wave-guide junction, electromagnetic waves running in otherwise separate wave guides. However, it is hardly reasonable to expect the necessary dynamic law to spring forth as soon as one translates the Hamilton-Jacobi equation of general relativity (Chapter 43) into a Schrödinger equation, simply because geometrodynamics, in both its classical and its quantum version, is built on standard differential geometry. That standard geometry leaves no room for any of those quantum fluctuations in connectivity that seem inescapable at small distances and therefore also inescapable in the final stages of gravitational collapse. Not geometry, but pregeometry, must fill the black box of gravitational collapse.
Probability distribution of the outcomes of collapse
"Reprocessing" the universe
All conservation laws transcended in the collapse of the universe
Little as one knows the internal machinery of the black box, one sees no escape from this picture of what goes on: the universe transforms, or transmutes, or transits, or is reprocessed probabilistically from one cycle of history to another in the era of collapse.
However straightforwardly and inescapably this picture of the reprocessing of the universe would seem to follow from the leading features of general relativity and the quantum principle, the two overarching principles of twentieth-century physics, it is nevertheless fantastic to contemplate. How can the dynamics of a system so incredibly gigantic be switched, and switched at the whim of probability, from one cycle that has lasted 10 11 10 11 10^(11)10^{11}1011 years to one that will last only 10 6 10 6 10^(6)10^{6}106 years? At first, only the circumstance that the system gets squeezed down in the course of this dynamics to incredibly small distances reconciles one to a transformation otherwise so unbelievable. Then one looks at the upended strata of a mountain slope, or a bird not seen before, and marvels that the whole universe is incredible:
mutation of a species, metamorphosis of a rock, chemical transformation, spontaneous transformation of a nucleus, radioactive decay of a particle, reprocessing of the universe itself.
If it cast a new light on geology to know that rocks can be raised and lowered thousands of meters and hundreds of degrees, what does it mean for physics to think of the universe as being from time to time "squeezed through a knothole," drastically "reprocessed," and started out on a fresh dynamic cycle? Three considerations above all press themselves on one's attention, prefigured in these compressed phrases:
destruction of all constants of motion in collapse; particles, and the physical "constants" themselves, as the
"frozen-in part of the meteorology of collapse;"
"the biological selection of physical constants."
The gravitational collapse of a star, or a collection of stars, to a black hole extinguishes all details of the system (see Chapters 32 and 33 ) except mass and charge and angular momentum. Whether made of matter or antimatter or radiation, whether endowed with much entropy or little entropy, whether in smooth motion or chaotic turbulence, the collapsing system ends up as seen from outside, according to all indications, in the same standard state. The laws of conservation of baryon number and lepton number are transcended [Chapter 33; also Wheeler (1971b)]. No known means whatsoever will distinguish between black holes of the most different provenance if only they have the same mass, charge, and angular momentum. But for a closed universe, even these constants vanish from the scene. Total charge is automatically zero because lines of force have nowhere to end except upon charge. Total mass and total angular momentum have absolutely no definable meaning whatsoever for a closed universe. This conclusion follows not least because there
is no asymptotically flat space outside where one can put a test particle into Keplerian orbit to determine period and precession.
Of all principles of physics, the laws of conservation of charge, lepton number, baryon number, mass, and angular momentum are among the most firmly established. Yet with gravitational collapse the content of these conservation laws also collapses. The established is disestablished. No determinant of motion does one see left that could continue unchanged in value from cycle to cycle of the universe. Moreover, if particles are dynamic in construction, and if the spectrum of particle masses is therefore dynamic in origin, no option would seem left except to conclude that the mass spectrum is itself reprocessed at the time when "the universe is squeezed through a knot hole." A molecule in this piece of paper is a "fossil" from photochemical synthesis in a tree a few years ago. A nucleus of the oxygen in this air is a fossil from thermonuclear combustion at a much higher temperature in a star a few 10 9 10 9 10^(9)10^{9}109 years ago. What else can a particle be but a fossil from the most violent event of all, gravitational collapse?
That one geological stratum has one many-miles long slope, with marvelous linearity of structure, and another stratum has another slope, is either an everyday triteness, taken as for granted by every passerby, or a miracle, until one understands the mechanism. That an electron here has the same mass as an electron there is also a triviality or a miracle. It is a triviality in quantum electrodynamics because it is assumed rather than derived. However, it is a miracle on any view that regards the universe as being from time to time "reprocessed." How can electrons at different times and places in the present cycle of the universe have the same mass if the spectrum of particle masses differs between one cycle of the universe and another? Inspect the interior of a particle of one type, and magnify it up enormously, and in that interior see one view of the whole universe [compare the concept of monad of Leibniz (1714), "The monads have no window through which anything can enter or depart"]; and do likewise for another particle of the same type. Are particles of the same pattern identical in any one cycle of the universe because they give identically patterned views of the same universe? No acceptable explanation for the miraculous identity of particles of the same type has ever been put forward. That identity must be regarded, not as a triviality, but as a central mystery of physics.
Not the spectrum of particle masses alone, but the physical "constants" themselves, would seem most reasonably regarded as reprocessed from one cycle to another. Reprocessed relative to what? Relative, for example, to the Planck system of units,
L = ( G / c 3 ) 1 / 2 = 1.6 × 10 33 cm , T = ( G / c 5 ) 1 / 2 = 5.4 × 10 44 sec , M = ( c / G ) 1 / 2 = 2.2 × 10 5 g , L = G / c 3 1 / 2 = 1.6 × 10 33 cm , T = G / c 5 1 / 2 = 5.4 × 10 44 sec , M = ( c / G ) 1 / 2 = 2.2 × 10 5 g , {:[L^(**)=(ℏG//c^(3))^(1//2)=1.6 xx10^(-33)cm","],[T^(**)=(ℏG//c^(5))^(1//2)=5.4 xx10^(-44)sec","],[M^(**)=(ℏc//G)^(1//2)=2.2 xx10^(-5)g","]:}\begin{aligned} L^{*} & =\left(\hbar G / c^{3}\right)^{1 / 2}=1.6 \times 10^{-33} \mathrm{~cm}, \\ T^{*} & =\left(\hbar G / c^{5}\right)^{1 / 2}=5.4 \times 10^{-44} \mathrm{sec}, \\ M^{*} & =(\hbar c / G)^{1 / 2}=2.2 \times 10^{-5} \mathrm{~g}, \end{aligned}L=(G/c3)1/2=1.6×1033 cm,T=(G/c5)1/2=5.4×1044sec,M=(c/G)1/2=2.2×105 g,
the only system of units, Planck (1899) pointed out, free, like black-body radiation itself, of all complications of solid-state physics, molecular binding, atomic constitution, and elementary particle structure, and drawing for its background only on the simplest and most universal principles of physics, the laws of gravitation and blackbody radiation. Relative to the Planck units, every constant in every other part of physics is expressed as a pure number.
Three hierarchies of fossils: molecules, nuclei, particles
Reason for identity in mass of particles of the same species?
Reprocessing of physical constants
No pure numbers in physics are more impressive than c / e 2 = 137.0360 c / e 2 = 137.0360 ℏc//e^(2)=137.0360\hbar c / e^{2}=137.0360c/e2=137.0360 and the so-called "big numbers" [Eddington (1931, 1936, 1946); Dirac (1937, 1938); Jordan (1955, 1959); Dicke (1959b, 1961, 1964b); Hayakawa (1965a,b); Carter (1968b)]:
10 80 particles in the universe,* 10 40 10 28 cm 10 12 cm ( radius of universe at maximum expansion ) ( "size" of an elementary particle , 10 40 e 2 G m M ( electric forces) ( gravitational forces) , 10 20 e 2 / m c 2 ( G / c 3 ) 1 / 2 ( size" of an elementary particle (Planck length) , 10 10 ( number of photons in universe ) ( number of baryons in universe . 10 80  particles in the universe,*  10 40 10 28 cm 10 12 cm (  radius of universe at   maximum expansion  )  "size" of an elementary   particle  , 10 40 e 2 G m M (  electric forces)  (  gravitational forces)  , 10 20 e 2 / m c 2 G / c 3 1 / 2  size" of an elementary   particle   (Planck length)  , 10 10 (  number of photons   in universe  )  number of baryons   in universe  . {:[∼10^(80)" particles in the universe,* "],[∼10^(40)∼(10^(28)(cm))/(10^(-12)(cm))∼(((" radius of universe at ")/(" maximum expansion "))^(**))/(([" "size" of an elementary "],[" particle "])","],[∼10^(40)∼(e^(2))/(GmM)∼((" electric forces) ")/((" gravitational forces) ",)],[∼10^(20)∼(e^(2)//mc^(2))/((ℏG//c^(3))^(1//2))∼(([" size" of an elementary "],[" particle "])/(" (Planck length) ")","],[∼10^(10)∼(((" number of photons ")/(" in universe ")))/(([" number of baryons "],[" in universe "]).]:}\begin{aligned} & \sim 10^{80} \text { particles in the universe,* } \\ & \sim 10^{40} \sim \frac{10^{28} \mathrm{~cm}}{10^{-12} \mathrm{~cm}} \sim \frac{\binom{\text { radius of universe at }}{\text { maximum expansion }}^{*}}{\left(\begin{array}{l} \text { "size" of an elementary } \\ \text { particle } \end{array}\right.}, \\ & \sim 10^{40} \sim \frac{e^{2}}{G m M} \sim \frac{(\text { electric forces) }}{(\text { gravitational forces) },} \\ & \sim 10^{20} \sim \frac{e^{2} / m c^{2}}{\left(\hbar G / c^{3}\right)^{1 / 2}} \sim \frac{\left(\begin{array}{l} \text { size" of an elementary } \\ \text { particle } \end{array}\right.}{\text { (Planck length) }}, \\ & \sim 10^{10} \sim \frac{\binom{\text { number of photons }}{\text { in universe }}}{\left(\begin{array}{l} \text { number of baryons } \\ \text { in universe } \end{array}\right.} . \end{aligned}1080 particles in the universe,* 10401028 cm1012 cm( radius of universe at  maximum expansion )( "size" of an elementary  particle ,1040e2GmM( electric forces) ( gravitational forces) ,1020e2/mc2(G/c3)1/2( size" of an elementary  particle  (Planck length) ,1010( number of photons  in universe )( number of baryons  in universe .
Some understanding of the relationships between these numbers has been won [Carter (1968b)]. Never has any explanation appeared for their enormous magnitude, nor will there ever, if the view is correct that reprocessing the universe reprocesses also the physical constants. These constants on that view are not part of the laws of physics. They are part of the initial-value data. Such numbers are freshly given for each fresh cycle of expansion of the universe. To look for a physical explanation for the "big numbers" would thus seem to be looking for the right answer to the wrong question.
In the week between one storm and the next, most features of the weather are ever-changing, but some special patterns of the wind last the week. If the term "frozen features of the meteorology" is appropriate for them, much more so would it seem appropriate for the big numbers, the physical constants and the spectrum of particle masses in the cycle between one reprocessing of the universe and another.
A per cent or so change one way in one of the "constants," c / e 2 c / e 2 ℏc//e^(2)\hbar c / e^{2}c/e2, will cause all stars to be red stars; and a comparable change the other way will make all stars be blue stars, according to Carter (1968b). In neither case will any star like the sun be possible. He raises the question whether life could have developed if the determinants of the physical constants had differed substantially from those that characterize this cycle of the universe.
Dicke (1961) has pointed out that the right order of ideas may not be, here is the universe, so what must man be; but here is man, so what must the universe
be? In other words: (1) What good is a universe without awareness of that universe? But: (2) Awareness demands life. (3) Life demands the presence of elements heavier than hydrogen. (4) The production of heavy elements demands thermonuclear combustion. (5) Thermonuclear combustion normally requires several 10 9 10 9 10^(9)10^{9}109 years of cooking time in a star. (6) Several 10 9 10 9 10^(9)10^{9}109 years of time will not and cannot be available in a closed universe, according to general relativity, unless the radius-at-maximumexpansion of that universe is several 10 9 10 9 10^(9)10^{9}109 light years or more. So why on this view is the universe as big as it is? Because only so can man be here!
In brief, the considerations of Carter and Dicke would seem to raise the idea of the "biological selection of physical constants." However, to "select" is impossible unless there are options to select between. Exactly such options would seem for the first time to be held out by the only over-all picture of the gravitational collapse of the universe that one sees how to put forward today, the pregeometry black-box model of the reprocessing of the universe.
Proceeding with all caution into uncharted territory, one must nevertheless be aware that the conclusions one is reaching and the questions one is asking at a given stage of the analysis may be only stepping stones on the way to still more penetrating questions and an even more remarkable picture. To speak of "reprocessing and selection" may only be a halfway point on the road toward thinking of the universe as Leibniz did, as a world of relationships, not a world of machinery. Far from being brought into its present condition by "reprocessing" from earlier cycles, may the universe in some strange sense be "brought into being" by the participation of those who participate? On this view the concept of "cycles" would even seem to be altogether wrong. Instead the vital act is the act of participation. "Participator" is the incontrovertible new concept given by quantum mechanics; it strikes down the term "observer" of classical theory, the man who stands safely behind the thick glass wall and watches what goes on without taking part. It can't be done, quantum mechanics says. Even with the lowly electron one must participate before one can give any meaning whatsoever to its position or its momentum. Is this firmly established result the tiny tip of a giant iceberg? Does the universe also derive its meaning from "participation"? Are we destined to return to the great concept of Leibniz, of "preestablished harmony" ("Leibniz logic loop"), before we can make the next great advance?
Rich prospects stand open for investigation in gravitation physics, from neutron stars to cosmology and from post-Newtonian celestial mechanics to gravitational waves. Einstein's geometrodynamics exposes itself to destruction on a dozen fronts and by a thousand predictions. No predictions subject to early test are more entrancing than those on the formation and properties of a black hole, "laboratory model" for some of what is predicted for the universe itself. No field is more pregnant with the future than gravitational collapse. No more revolutionary views of man and the universe has one ever been driven to consider seriously than those that come out of pondering the paradox of collapse, the greatest crisis of physics of all time.
Black hole as "laboratory" model for collapse of universe

"Omnibus ex nihil ducendis sufficit unum!" (One suffices to create Everything of nothingl) Gottrried wilhelm von leibniz
(Sur l'air de Auprès de ma blonde)
Dans les jardins d'Asnières La science a refleuri
Tous les savants du monde
Apportent leurs écrits

Refrain:

Auprès de nos ondes Qu'il fait bon, fait bon, fait bon Auprès de nos ondes Qu'il fait bon rêver
Tous les savants du monde Apportent leurs écrits Loi gravitationnelle Sans tenseur d'énergie
Loi gravitationnelle
Sans tenseur d'énergie
De ravissants modèles
Pour la cosmologie
De ravissants modèles
Pour la cosmologie
Pour moi ne m'en faut guère
Car j'en ai un joli
Pour moi ne m'en faut guère Car j'en ai un joli
// est dans ma cervelle
Voici mon manuscrit
Le champ laisse des plumes Aux bosses de l'espace-temps En prendrons quelques unes Pour décrire le mouvement
C. CATTANEO, J. GÉHÉNIAU
M. MAVRIDES, and M. A. TONNELAT
Reprinied with the kind pemisssion of the guthors.
When Arthur Evans began this excavation neither he nor anyone knew that he would uncover an unknown world.
PAT (Mrs. Hypatia Vourloumis at Knossos (1971))
And as imagination bodies forth
The form of things unknown, the poet's pen
Turns them to shapes, and gives to airy nothing A local habitation and a name.
SHAKESPEARE
Appreciation and farewell to our patient reader.
Chaces ho Niene Kips. Jhome

  1. *See Thorne and Will (1971), or Will (1972), for expositions of both frameworks and a comparison of them.
  2. *R. H. Dicke calls this principle "The weak equivalence principle." We prefer to avoid confusion with the equivalence principle (Chapter 16).
    \dagger In general relativity, one often uses an alternative definition of test body, which places no constraint on the self-gravitational energy [abandon condition (1) while retaining (2)]. Such a definition is preferable, in principle, because the theory of matter has not been developed sufficiently to decide whether (and no objective test has ever been proposed to decide whether), gravitational energy at the subnuclear scale is a small fraction, a large fraction, or the entirety of the rest mass. But for present purposes a definition constraining test bodies to have M / R 1 M / R 1 M//R≪1M / R \ll 1M/R1 is preferable for two reasons. First, most theories of gravity that currently "compete" with Einstein's (a) agree with the principle of uniqueness of free fall when the macroscopic, Newtonian, self-gravitational energy is neglected ( M / R 1 ) ( M / R 1 ) (M//R≪1)(M / R \ll 1)(M/R1), but (b) violate that principle when macroscopic, Newtonian self-gravitational energy is taken into account. See $ 40.9 $ 40.9 $40.9\$ 40.9$40.9 for details. Second, the test bodies used in the Eobtvös-Dicke experiment have M / R M / R M//RM / RM/R so small that their macroscopic, Newtonian, self-gravitational energies are, in fact, negligible ( M / R E grav / M 10 27 M / R E grav  / M 10 27 M//R∼E_("grav ")//M∼10^(-27)M / R \sim E_{\text {grav }} / M \sim 10^{-27}M/REgrav /M1027 ).
  3. *For a 2 per cent test of time dilation with muons of ( 1 v 2 ) 1 / 2 12 1 v 2 1 / 2 12 (1-v^(2))^(-1//2)∼12\left(1-v^{2}\right)^{-1 / 2} \sim 12(1v2)1/212 in a storage ring, see Farley, Bailey, Brown, Giesch, Jöstlein, van der Meer, Picasso, and Tannenbaum (1966). For earlier time-dilation experiments see Frisch and Smith (1963); Durbin, Loar, and Havens (1952); and Rossi and Hall (1941).
    \dagger See p. 18 of Lichtenberg (1965) for a discussion of Lorentz invariance, spin and statistics, the TCP theorem, and relevant experiments.
    • The experiment of Farley et al, is a 2 percent check of acceleration-independence of the muon decay rate for energies E / m = ( 1 v 2 ) 1 / 2 12 E / m = 1 v 2 1 / 2 12 E//m=(1-v^(2))^(-1//2)∼12E / m=\left(1-v^{2}\right)^{-1 / 2} \sim 12E/m=(1v2)1/212 and for accelerations, as measured in the muon rest frame, of a = 5 × 10 20 cm / sec 2 = 0.6 cm 1 a = 5 × 10 20 cm / sec 2 = 0.6 cm 1 a=5xx10^(20)cm//sec^(2)=0.6cm^(-1)a=5 \times 10^{20} \mathrm{~cm} / \mathrm{sec}^{2}=0.6 \mathrm{~cm}^{-1}a=5×1020 cm/sec2=0.6 cm1.
  4. *For a review of other, less-precise redshift experiments, see Bertotti, Brill, and Krotkov (1962).
  5. *For a slightly narrower definition of metric theories, see Thorne and Will (1971).
  6. a ^("a "){ }^{\text {a }} These heuristic descriptions are based on equations (39.23).
    b b ^(b){ }^{b}b For expositions of these theories see Box 39.1. For derivation of their PPN values and of PPN values for other theories, see Ni (1972).
  7. *See, e.g., a long series of papers by Chandrasekhar and his associates in the Astrophysical Journal, beginning with Chandrasekhar (1965a,b,c).
  8. *In the solar system, post-Newtonian corrections due to anisotropic stresses are so much smaller than other post-Newtonian corrections that there is no hope of measuring them in the 1970's. For this reason, elsewhere in the literature (but not in this book) the PPN formalism treats all stresses at the post-Newtonian level as isotropic pressures, thereby setting to zero the PPN parameter η η eta\etaη of $ § 39.8 39.11 $ § 39.8 39.11 $§39.8-39.11\$ \S 39.8-39.11$§39.839.11.
  9. *WARNING: Throughout the literature the notation Φ Φ Phi\PhiΦ is used where we use Ψ Ψ Psi\PsiΨ for the functional ( 39.23 g ) ( 39.23 g ) (39.23g)(39.23 \mathrm{~g})(39.23 g), and ϕ ϕ phi\phiϕ is used for our ψ ψ psi\psiψ. We are forced to violate the standard notation to avoid confusion with the Newtonian potential Φ = U Φ = U Phi=-U\Phi=-UΦ=U. However, we urge that nobody else violate the standard notation!
    • Of course, from the point of view of Einstein's full general relativity theory, all that legalistically counts is the one and only curved-spacetime geometry of the real physical world. All these "individual fields" are mere bookkeepers' discourse, and they are best abandoned (they cease to be useful) when one passes from the post-Newtonian limit to the full Einstein theory.
  10. a ^("a "){ }^{\text {a }} Here (observed delay)/(Einstein prediction) is the value of 1 2 ( 1 + γ ) 1 2 ( 1 + γ ) (1)/(2)(1+gamma)\frac{1}{2}(1+\gamma)12(1+γ) obtained by fitting the observational data, Δ τ ( τ ) Δ τ ( τ ) Delta tau(tau)\Delta \tau(\tau)Δτ(τ), to a more sophisticated version of the PPN prediction (40.14). This more sophisticated version includes the gravitational influences of all the planets on the orbits of reflector and Earth: also the effect of the moon on the Earth's orbit and the effect of the Earth's rotation on the travel time; also, to as good an extent as possible, the delay due to dispersion in the solar corona and wind. "Formal standard error" and "one-sigma error" are defined in the table in Box 40.1.
  11. *This analogy can be made mathematically rigorous; see footnote on p. 255 of Thorne (1971); see also, $ 21.12 $ 21.12 $21.12\$ 21.12$21.12 on Mach's principle.
  12. *The dragging of inertial frames by a rotating body plays important roles elsewhere in gravitation physics, e.g., in the definition of angular momentum for a gravitating body ( $ 19.2 $ 19.2 $19.2\$ 19.2$19.2 ), and in black-hole physics (Chapter 33). The effect was first discussed and calculated by Thirring and Lense (1918). More recent calculations by Brill and Cohen (1966) of idealized situations where the effect may be large give insight into the mechanism of the effect. See also the discussion of Mach's principle in $ 21.12 $ 21.12 $21.12\$ 21.12$21.12.
  13. *See any standard textbook for a description of Cavendish experiments. By his original version of the experiment, with two separated spheres suspended by fine wires, Henry Cavendish (1798) inferred the mass and hence the density of the Earth. He reported: "By a mean of the experiments made with the wire first used, the density of the Earth comes out 5.48 times greater than that of water; and by a mean of those made with the latter wire it comes out the same; and ... the extreme results do not differ from the mean more than 0.38 , or 1 / 14 1 / 14 1//141 / 141/14 of the whole." The most precise method of measuring G G GGG today [Rose et al. (1969)] gives G C = ( 6.674 ± .004 ) × 10 8 cm 3 / g sec 2 G C = ( 6.674 ± .004 ) × 10 8 cm 3 / g sec 2 G_(C)=(6.674+-.004)xx10^(-8)cm^(3)//gsec^(2)G_{\mathrm{C}}=(6.674 \pm .004) \times 10^{-8} \mathrm{~cm}^{3} / \mathrm{g} \mathrm{sec}^{2}GC=(6.674±.004)×108 cm3/gsec2 (one standard deviation).
  14. *In the same city on June 21, 1972 President Eamon de Valera told one of the authors that, while in jail one evening in 1916, scheduled to be shot the next morning, he wrote down the formula of which he was so fond, i 2 = j 2 = k 2 = i j k = 1 i 2 = j 2 = k 2 = i j k = 1 i^(2)=j^(2)=k^(2)=ijk=-1\boldsymbol{i}^{2}=\boldsymbol{j}^{2}=\boldsymbol{k}^{2}=\boldsymbol{i} \boldsymbol{j} \boldsymbol{k}=-1i2=j2=k2=ijk=1.
  15. *An "active" transformation changes one vector into another, while leaving unchanged the underlying reference frame (if there is one). By contrast, a "passive" transformation leaves all vectors unchanged, but alters the reference frame. All transformations in previous chapters of this book were passive.
  16. ^(**){ }^{*} Actually S S ADM 16 π S true = 16 π S S ADM  16 π S true  = 16 π S-=S_("ADM ")-=16 piS_("true ")=16 piS \equiv S_{\text {ADM }} \equiv 16 \pi S_{\text {true }}=16 \piSSADM 16πStrue =16π (true dynamic path length).
  17. The Marchon lecture given by J. A. W. at the University of Newcastle-upon-Tyne, May 18, 1971, and the Nuffield lecture given at Cambridge University July 19, 1971, were based on the material presented in this chapter.
  18. *Wheeler's story about the vizier and what the vizier had to say about superspace was told at the May 18, 1970, Gwatt Seminar on the Bearings of Topology upon General Relativity. Kilmister's (1971) published article alludes to the unpublished story, but does not actually contain it.
  19. *Values based on the "typical cosmological model" of Box 27.4; subject to much uncertainty, in the present state of astrophysical distance determinations, not least because the latitude in these numbers is even enough to be compatible with an open universe.
  20. All of these endeavors are based on the belief that existence should have a completely harmonious structure. Today we have less ground than ever before for allowing ourselves to be forced away from this wonderful belief.